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PREFACE, 1981 


When I turned doctor in 1932, I was still wholly under the spell 
of Principia Mathematica. To me this great work, by my teacher 
Whitehead and his pupil Russell, was the datum plane of mathematical 
logic. Improvement upon Principia was what I thought progress in 
mathematical logic consisted in. Progress in that sense was indeed no 
great challenge. The confusion of use and mention cried out for correc¬ 
tion, along with the resulting muddles over propositional functions, 
implication, and ramified types. Uneconomical definitions and nota¬ 
tions lay open for pruning. Great pioneer works are proverbially easy 
to improve. All these things I appreciated at the time. It was only in 
the following year, however, in Prague and Warsaw, that my perspec¬ 
tive was widened by Carnap, Tarski, and others to take in what had 
been happening to logic in foreign climes. 

My work in progress benefited superficially from my European, ex¬ 
posure but was already deeply entrenched in the Principia tradition, 
where, substantially, it stayed. It was A System of Logistic , 1934. 
Mathematical Logic, six years later, contrasts markedly with that 
youthful work in its sweep, independence, and maturity. Abundant 
literature is noted and heeded, and sources are traced. Exposition, 
though earnest, is fuller and more relaxed. But there are traits that 
link it still, unlike my later books, with the Principia model. Like 
Principia it develops its message in a monolithic deductive system based 
on fixed axioms and rules. Like Principia it proceeds by explicit proofs 
in a rigid notation rather than settling for sufficient hints of how such 
proofs would run. And like Principia it subsumes set theory under logic 
instead of recognizing it as a mathematical discipline beyond logic. 

Thanks to the acumen of Barkley Rosser (I do say thanks), the set 
theory of Mathematical Logic of 1940 came a cropper in 1941. The 
story is in another preface among these pages: how the publisher 
Norton put my makeshift corrigendum slip into the remaining copies 
of the book, how Harvard University Press rescued the book for a 
second printing in 1947 and printed the slip into it, and how Hao 
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Wang supplanted my shotgun emendation with a proper one, con¬ 
secrated in the revised edition of 1951. 

The set theory of early drafts of the book was that of my 1937 paper 
“New Foundations.” Vexatious limitations of that system were grate¬ 
fully overcome when it occurred to me to liberalize it by a device of 
von Neumann’s. I overdid it, and this is what Wang brought back into 
line. Thus corrected, the system retains its superiority over that of 
“New Foundations.” But the latter retains an ancillary interest in 
view of a proof, also due to Wang, that the consistency of either system 
assures that of the other. 

There is a blemish, however, affecting this system. Rosser has 
proved that the class of all natural numbers cannot be proved in it to 
be an “element,” or set, if the system is consistent. We must acqui¬ 
esce rather in the inelegance of an added axiom. 

Thus, though the system here presented has its attractions, I am not 
its single-minded champion. In Set Theory and Its Logic (Harvard, 
1963, 1969) I have compared various of the set theories in the litera¬ 
ture, noting the motivations and advantages of each. 

In the present book I used no schemata but referred only to their 
instances, the actual sentences, replete with unspecified predicates. 
Furthermore, I did not settle for open sentences, with free variables, 
but insisted on closed sentences, true and false. My reason was that 
these are what logic is for; schemata and even open sentences are tech¬ 
nical aids along the way. What is actually shown, then, is that all 
logical truths (in my narrow latter-day sense, that is, all quantifica- 
tional truths) issue purely by modus ponens from monadically true 
ones of certain narrowly specified forms. This adherence to closed sen¬ 
tences is not trivial; the adequacy of modus ponens unmediated by open 
sentences takes a bit of proving (*111). 

My motive for focusing on the closed sentences was, we see, philo¬ 
sophical : the philosophical motive of keeping one’s eye on the ball. Now 
since the sentences were built of arbitrary and unspecified predicates, I 
had perpetually to write about them rather than write them. Quota¬ 
tion was not enough, for it blocked generality. It was to clear this 
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obstacle without confusing use and mention that I devised my corners 
of quasi-quotation. 

Schemata could indeed have been admitted still as heuristic aids, and 
they would have made manipulations easier and more vivid; but I 
eschewed them for pedagogical reasons of a higher order. I was bent on 
combatting confusions over matters of ontology. 1 feared that readers 
would persist in thinking of sentence letters and predicate letters as 
variables taking propositions and properties as values, protest as I 
might. My hope of keeping readers cognizant of what I was talking 
about, and what not, lay in opening no easy channel for wrong ideas. 
We can take the easier line with equanimity nowadays, for logic teach¬ 
ers now tend to see these matters aright; and I like to think that 
I helped. 

I like to think that I also helped in the struggle against confusions 
of use and mention. Certainly I was persistent in precept and scrupu¬ 
lous in example, and happily the situation has vastly improved. Still, my 
efforts against the related confusion of implication with the conditional, 
and of equivalence with what I named the biconditional, have been 
only partially successful if we may judge by lingering nomenclature. 

The final chapter had two purposes. One purpose was to show how 
the metalogical discourse about a formal system would look when for¬ 
malized in turn. The other and higher purpose was to teach a proof of 
Godel’s incompleteness theorem. My proof, unlike Godel’s shuns num¬ 
bers until the coup de grace. It proceeds rather in what I called proto¬ 
syntax, as a proof that protosyntactical truth is not definable in 
protosyntax. Thus seen, it is a case of Tarski’s theorem that theories 
meeting certain reasonable conditions cannot contain their own truth 
definitions. Finally Godel’s theorem is derived by correlating proto¬ 
syntax with number theory. 

A novelty of this presentation is the device of “framed ingredient” 
for reducing sequences of expressions to expressions and thus gaining 
the effect of quantifying over sequences and finite sets. A more con¬ 
spicuous novelty is the avoidance of singular terms other than variables 
and hence the absence of any appeal to substitution of such terms. Sub¬ 
stitution played a central role in Godel’s constructions and Tarski’s. 
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Lacking it, I had to develop a method of identity matrices, as I called 
them, which does not make for easy reading. This departure was a 
consequence of my policy in earlier chapters of excluding singular 
terms, other than variables, from primitive notation and admitting 
them only by contextual definition. It was a way of simplifying theory 
and also of helping to clarify ontological considerations. 

I think of 1938-39, when my writing of this book was in its 
penultimate year and I was living on a Portuguese island in the mid- 
Atlantic. I worked for a while at Godel’s proof of the completeness of 
predicate logic, with a view to adapting it to my treatment of quantifi¬ 
cation theory and including it in the book. I found it was going to make 
too heavy reading to warrant inclusion. It would not have been as 
heavy as my presentation of Godel’s incompleteness theorem, but heavi¬ 
ness must be offset by importance; the one theorem is of world-shaking 
importance, the other less. Little did I imagine how brief and readable 
a form the completeness proof would acquire some day in Methods 
of Logic. 

Harvard University W. V. Quine 

January 1981 



PREFACE TO THE REVISED EDITION 

The revisions made in this edition have affected half the pages, 
mostly in small ways. Some thirty pages have undergone major 
changes. 

The prime mover of the most important revision was Rosser, who 
discovered, shortly after the first edition appeared, that the axioms 
of class theory in the middle of the book were contradictory. On 
learning of his discovery I arranged with the publisher to paste a 
corrigendum slip into the remaining stock of the book, indicating a 
makeshift repair of the system. The text of this slip was also in¬ 
serted in the second printing (1947). Lately, however, Wang has 
devised a better repair, admirably suited both to the spirit of the 
original system and to most of the details of the original exposition. 
It has hence been possible in the present edition to put the text to 
rights and leave no scar. The revisions thus entailed have affected 
pages 157-160, 162, 163, 166, 193, 238, and 305. 

Another revision stems from Berry, who showed how a minor 
modification of the concept of closure (page 79) could yield a reduc¬ 
tion in the axioms of quantification. Berry’s idea was mentioned in 
the second printing (page 89), but in the present edition it has been 
directly incorporated, at the cost of reworking pages 79-95 and 
300-301 and changing reference numbers and quantifiers in a hun¬ 
dred pages. 

Over the decade since the first edition appeared, two major diffi¬ 
culties for readers have become manifest. One is the distinction be¬ 
tween theorems formally deduced and metatheorems informally 
established. The other is the reasoning behind Godel’s theorem of 
incompletability. These two difficulties have been alleviated in the 
present edition, the one by the addition of an appendix and the 
other by the insertion of some supplementary exposition (pages 
307-312). 

In the treatment of real numbers two subtle errors, discovered by 
Mr. Ralph M. Krause and Dr. Wang, have been eradicated by 
revising pages 273-275. Other emendations and additions on mis¬ 
cellaneous topics have been made on pages 25, 81, 89, 144, 177f, 
191, 202, 208f, 229f, 292, 299, and 318. 

Cambridge, Mass. 

January 28, 1951. 
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Of that which receives precise formulation in mathematical 
logic, an important part is already vaguely present as a basic 
ingredient of daily discourse. The passage from non-mathematical, 
non-philosophical common sense to the first technicalities of 
mathematical logic is thus but a step, quickly taken. Once within 
the field, moreover, one need not travel to its farther end to reach 
a frontier; the field is itself a frontier, and investigators are active 
over much of its length. Even within an introductory exposition 
there is room for novelties which may not be devoid of interest to 
the specialist. Textbook and treatise can hence be combined 
within the same covers in mathematical logic as in few other fields. 
Such is the intended status of the present book. 

The material presented is substantially that covered in my course 
Mathematics 19 at Harvard. I have undertaken to present it in 
such a way as to presuppose neither previous acquaintance with 
mathematical logic nor any special training in mathematics or 
philosophy. But the book is intended for the serious reader; 
there are sections which demand close study. Rigor has not, in 
general, been consciously compromised in favor of perspicuity. 

Chapter I deals with the truth-junctional composition of state¬ 
ments: the modification of statements by denial, and the forma¬ 
tion of compounds by such connectives as ‘and’, ‘or’, ‘if-then’. 
These are reduced to the single connective of joint denial, ‘neither- 
nor’, following Sheffer. The contrast is emphasized between use 
of expressions and discourse about expressions, and the controversy 
over implication is considered in the light of this distinction. A 
metamathematical, or syntactical, notation is introduced to facili¬ 
tate discourse about statements and other expressions; and the 
principles of statement composition are expounded in these terms, 
without recourse to the so-called propositional variables ‘p’, ‘q, 
etc. of earlier works. The property of tautology — logical truth of 
the sort which depends only on truth-functional composition — 
receives a new formulation in metamathematical terms, and a series 
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of tautologous forms is adduced. The deductive method is dis¬ 
pensed with, at the level of tautology, in favor of the method of 
tabular calculation. 

Chapter II treats of quantification: the formal correspondent of 
the idioms involving ‘all’ and ‘some.’ It is in connection with 
quantification that the variable makes its first appearance; and 
accordingly some attention is devoted to clarifying the nature of 
the variable, and exhibiting its analogy to the pronoun of ordinary 
language. Quantification theory, like the preceding part of logic, 
is expounded within the medium of metamathematics; its pres¬ 
entation in any other medium appears disadvantageous, indeed, 
because of subtleties having to do with the so-called bound and 
free occurrences of variables. Here as in Chapter I, methods of 
calculation are deemed preferable to methods of deduction; but a 
compromise now has to be struck, because of the fact that no 
method of calculation is adequate to the truths of quantification 
theory. The compromise consists in specifying an infinite set of 
so-called axioms of quantification by a method of calculation, and 
then allowing the theorems to proceed from these by one de¬ 
ductive rule of the simplest sort: modus ponens. A further de¬ 
parture, which recommends itself as more intuitive than the usual 
procedure, is this: the axioms of quantification are specified in 
such a way that the theorems come to include only statements in 
the strict sense — formulae containing no “free” variables, no 
variables beyond those used in quantification. 

In Chapter III the connective ‘e’ of membership emerges, and 
therewith classes. This connective, taken as supplementary to 
quantification and joint denial, proves to afford an adequate basis 
for the definition of the notions of logic and those of arithmetic and 
derivative disciplines as well. An essential step in the series of 
definitions is the introduction of abstraction — the notation 
whereby classes are specified through conditions on their members. 
Its introduction is accomplished by contextual definition. On the 
basis of abstraction in turn the device of description is introduced, 
which corresponds to the idiom ‘the entity such that . . .’. It 
now becomes possible to construe names in general, abstract and 
concrete alike, as mere shorthand which is eliminable at will from 
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all discourse. Besides the economy which it affords, this procedure 
proves to have the value of effecting a cleavage between syntactical 
questions of meaningfulness and factual questions of existence. 

In Chapter IV the axioms of quantification are supplemented 
with a new infinity of axioms, providing the essential properties of 
membership and derivative notions; but no rule of inference is 
needed beyond the original modus ponens. Identity and unit 
classes are defined, and likewise the notions of the ordinary algebra 
of classes; and a train of theorems is derived. 

The logical antinomies, e.g. Russell’s, are avoided by a method 
which is less restrictive than past methods such as the theory of 
types. The method turns, like von Neumann’s, on construing 
certain entities as incapable of membership; but the entities so 
construed are far scarcer than under von Neumann’s theory. 
Besides making for algorithmic facility, this liberalization turns 
out to render the existence of infinite classes demonstrable without 
special postulation. 

In Chapter V relations are defined on the basis of class theory, 
by the Wiener-Kuratowski method, and the usual concepts of 
relation theory are introduced and investigated. Functions, in 
turn, are defined on the basis of relation theory. Analogues of the 
device of abstraction which was used in class theory reappear in 
connection with relations and functions. The definition of natural 
number comes in Chapter VI; a subsequent definition then intro¬ 
duces the operation of raising a relation to a numerical power, 
which operation is found to provide a simple and uniform means 
of defining the various operations of arithmetic. Certain of the 
basic theorems of arithmetic are proved, and the standard con¬ 
structions which lead from this level into higher reaches of quanti¬ 
tative mathematics are briefly outlined. 

In Chapter VII we turn to the formalization of the metamathe- 
matical or syntactical machinery involved in discourse about a 
formalism such as presented in the foregoing chapters. Godel’s 
theorem regarding the incompletability of logic and arithmetic is 
derived along novel lines, and its scope is somewhat extended. 

Cambridge, Mass. 

April 7, 1940. 
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MATHEMATICAL LOGIC 


INTRODUCTION 

MATHEMATICAL logic differs from the traditional formal logic 
so markedly in method, and so far surpasses it in power and subtlety, 
as to be generally and not unjustifiably regarded as a new science. 
Its crude beginnings are placed with George Boole, in the middle of 
the last century. Fragments foreshadowing mathematical logic 
date back much farther than Boole — as far back indeed as Leibniz; 
but it was from Boole onward through Peirce, Schroder, Frege, 
Peano, Whitehead, Russell, and their successors that mathematical 
logic underwent continuous development and reached the estate of 
a reputable department of knowledge. 

The traditional formal logic, dating in its essentials from Aris¬ 
totle, is nevertheless the direct progenitor of mathematical logic. 
The striking differences between the two must not be allowed to 
obscure the fact that they are both “logic” in the strictest sense of 
the word. They both have, vaguely speaking, the same subject 
matter. Just what that subject matter is, it is not easy to say; 
the usual characterizations of logic as “the science of necessary 
inference”, “the science of forms”, etc., are scarcely informative 
enough to be taken as answers. 

But if we shift our attention from subject matter to vocabulary, 
it is easy to draw a superficial distinction between the truths of 
logic and true statements of other kinds. A logically true state¬ 
ment has this peculiarity: basic particles such as ‘is’, ‘not’, ‘and’, 
‘or’, ‘unless’, ‘if’, ‘then’, ‘neither’, ‘nor’, ‘some’, ‘all’, etc. occur 
in the statement in such a way that the statement is true inde¬ 
pendently of its other ingredients. Thus, consider the classical 
example: 

(1) If every man is mortal and Socrates is a man then Socrates 
is mortal. 

Not only is this statement true, but it is true independently of the 

1 
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constituents ‘man’, ‘mortal’, and ‘Socrates’; no alteration of 
these words is capable of turning the statement into a falsehood. 
Any other statement of the form: 

(2) If every — is — and — is a — then — is — 

is equally true, so long merely as the first and fourth blanks are 
filled alike, and the second and last, and the third and fifth. A 
still simpler logical truth is: 

(3) Socrates is mortal or Socrates is not mortal; 

alteration of ‘Socrates’ and ‘mortal’ is incapable of making the 
statement false. 

A word may be said to occur essentially in a statement if replace¬ 
ment of the word by another is capable of turning the statement 
into a falsehood. 1 When this is not the case, the word may be said 
to occur vacuously. Thus the words ‘Socrates’ and ‘man’ occur 
essentially in the statement ‘Socrates is a man’, since the state¬ 
ments ‘Bucephalus is a man’ and ‘Socrates is a horse’ are false; 
on the other hand ‘Socrates’ and ‘mortal’ occur vacuously in (3), 
and ‘Socrates’, ‘man’, and ‘mortal’ occur vacuously in (1). 
The logical truths, then, are describable as those truths in which 
only the basic particles alluded to earlier occur essentially. 

Those particles may be said to constitute the logical vocabulary. 
They are basic to all discourse. If we were to undertake e.g. to 
specify a geological vocabulary, comprising the words which occur 
essentially in the truths of geology, we should have to include in it 
not only such words as ‘moraine’, ‘fault’, etc., but also the whole 
of the logical vocabulary; and similarly for any other discipline. 
Accordingly the truths of logic may be reckoned trivially among 
the truths of geology, and among the truths of economics, and so 
on. This lends some sense to the dictum that logic has a universal 
subject matter, and is the common denominator of the special 
sciences. 

The words comprising the logical vocabulary can be substantially 
reduced, for we can paraphrase some of them with help of the rest. 

1 For a somewhat more careful formulation see my essay “Truth by Con¬ 
vention,” pp. 93 ff. 
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The sense of ‘unless’, e.g., is adequately conveyed by ‘or’; again 
the joint use of ‘neither-nor’ and ‘not’, in the fashion ‘neither 
not . . . nor not . . provides an adequate paraphrase of ‘and’. 
When this sort of reduction is carried to the limit, the logical 
vocabulary comes to comprise only ‘is’, ‘neither-nor’, and a device 
corresponding roughly to the word ‘every’, together with a certain 
scheme of pronouns auxiliary to the latter device. (Cf. §§ 9, 12, 
22-23.) 

What is ordinarily classified under mathematical logic or tra¬ 
ditional formal logic includes, indeed, not only logical truths in the 
proposed sense — truths involving just the logical vocabulary 
essentially — but also statements about such truths. It is custom¬ 
ary to include within logic not only such statements as (1) and (3), 
but also such statements as this: 

(4) Any statement of the form (2) is true so long as the first and 
fourth blanks are filled alike, and the second and last, and 
the third and fifth. 

The present book, in particular, consists in large part of such state¬ 
ments as (4). We must thus distinguish two senses of logic, a 
broader and a narrower; logic in the narrower sense comprises 
those truths which contain only the so-called logical vocabulary 
essentially, while logic in the broader sense includes both logic in 
the narrower sense and discourse about it. Discourse of the latter 
kind is classifiable, in large part at least, under the head of formal 
syntax (cf. Ch. VII). Over the years the term ‘logic’ has of course 
been applied also to a vast range of other topics, encroaching upon 
rhetoric, psychology, epistemology, metaphysics; but I shall not 
attempt to find a unifying principle among these far-flung applica¬ 
tions of the term. 

Preparatory to the foregoing characterization of logical truths, 
we took the general notion of truth for granted. We simply dis¬ 
tinguished logical truths from others, by specifying what words 
occur essentially in the logical truths. Now the general notion of 
truth, central as it is to baffling problems of philosophy, may ap¬ 
pear rather too big a thing to take for granted. Repudiation of 
“truth with a capital ‘T’” is a favorite way, indeed, of professing 
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alignment with the hard-headed. But in point of fact there is no 
denying that we know what it means to say that a given statement 
is true — absolutely True — just as clearly as we understand the 
given statement itself. The circumstances under which the state¬ 
ment: 

(5) Jones smokes 

would be said to be true, e.g., are precisely the circumstances under 
which Jones himself would be said to smoke. Truth of the state¬ 
ment (5) is no more mysterious than the notions of Jones and 
smoking. Applied to any statement S, the word ‘true’ is no ob¬ 
scurer than the obscurest word in the statement S itself; for to 
say that S is true is simply to say S. 1 This would seem to condone 
the present expository recourse to the general notion of truth, 
however the subtle problems which that notion involves may fare. 

To determine the truth of the statement (5), it is not enough to 
inspect the statement; we must also observe Jones. With most 
other statements the case is similar. Thus when we say that it is 
the business of the geologist to find out what statements are 
geological truths, we do not commit him to a sedentary study of 
statements; we call upon him to spend a good part of his time 
inspecting bluflFs and craters. But logic, indeed mathematics 
generally, is otherwise. The truth of (1) and (3), e.g., is recog¬ 
nizable on inspection merely of the statements themselves. Con¬ 
fronted with any logical truth or indeed any true statement of 
mathematics, no matter how complex, we recognize its truth if at 
all merely by inspecting the statement and reflecting or calculating; 
observation of craters, test tubes, or human behavior is of no avail. 
Insofar then as logical truth is discernible at all, standards of logical 
truth can be formulated in terms merely of more or less complex 
notational features of statements; and correspondingly for 
mathematics generally. Thus it is that the logician and mathe¬ 
matician talk about statements so much more than the geologist 
does; and thus it is that logic in a broad sense is commonly taken 
to include discourse about the statements of logic in the narrower 

1 For an instructive elaboration of this theme see §1 of Tarski’s “Wahrheits- 
begriff.” 
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sense, whereas no such bifurcation is ordinarily imposed upon 
geology. 

If standards of logical or mathematical truth are to be formulated 
in terms merely of the observable features of statements, a first 
important step is revision and schematization of the language in 
such fashion as to put the relevant features of statements into the 
simplest possible form. Revision in this direction has long since 
taken place in mathematics; perhaps the most basic departures 
from ordinary language are the use of parentheses to indicate 
grouping, and the use of variables for those purposes of cross- 
reference for which ordinary language uses pronouns (cf. § 12). 
Through emulation of mathematics, a similar revision took place 
in logic from Boole onward; this is indeed so obvious a point of 
contrast between the old logic and the new that the latter is often 
called “symbolic logic”. But far more radical linguistic revisions 
are wanted if such projects as the formulation of truth criteria are 
to be expedited to the fullest. It is helpful to reduce the notions of 
logic and mathematics to a minimum, by defining some in terms of 
others; for this reduces the variety of statements which the truth 
criterion must cover. Reduction of this kind can, it turns out, be 
carried to great lengths. Mention has already been made of the 
meager array of devices which suffices for logic: analogues of 
‘is’, ‘neither-nor’, ‘every’, and pronouns. What is more sur¬ 
prising, these devices prove adequate not only to logic but to pure 
mathematics generally (cf. §§ 23 , 52 ); mathematics reduces to logic. 

Having given the language of logic the most economical and 
schematic form we can, we might next hope to devise some routine 
test which, applied merely to the notational patterns of statements, 
will distinguish always between those which are logically true and 
those which are not. But this is a good deal to hope, particularly 
in view of the reducibility of mathematics in general to logic. 
Every mathematical problem would become soluble by a mechani¬ 
cal procedure — even the celebrated Fermat problem, which has 
resisted solution for three centuries. Publication of proofs in 
mathematics would never again be necessary; results would 
simply be stated subject to mechanical check on the part of the 
reader. 
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Diffident of so bold a project, we might try to formulate some 
less powerful notational criterion of logical and mathematical 
truth: a criterion whose fulfillment by any given statement is 
discernible only by luck rather than by an infallible routine test. 
Such, indeed, is the character of mathematical proof; a proof once 
discovered can be mechanically checked, but the actual discovery 
of the proof is a hit and miss matter. (Cf. §§ 16, 55.) Our present 
more modest objective, then, can take the form of an explicit 
formulation of the notion of proof, or theorem, such as will involve 
reference only to the notational patterns of statements. But 
recent developments indicate, actually, that an exhaustive criterion 
of logical or mathematical truth along even these more modest lines 
is impossible. Given any rules of proof which do not actually lead 
to falsehoods, there will be mathematical truths which cannot be 
proved by the rules (cf. § 60). There will always be some demon¬ 
strably indemonstrable mathematical truths. We must content 
ourselves with a version of “theorem” which covers only one or 
another important portion of the truths of logic and mathematics. 

The fact remains that insofar as logical or mathematical truth 
can be detected at all, the standards can be formulated explicitly 
as criteria hinging upon the notational patterns of statements. 
Such truths of logic and mathematics as are not obvious are 
recognized, if at all, by proof; and the considerations which go to 
make up a proof do admit of explicit formulation in terms of 
notational features of statements. Such formulation is not only 
possible, but highly desirable for the increased insight which it 
promises. The very discovery that there must always be indemon¬ 
strable mathematical truths is a discovery which could not have 
been made without analyzing the notion of “proof” or “theorem” 
in terms explicitly of notational features of statements. 

The described developments are by no means the whole motiva¬ 
tion of the modern refinements in logic. The reduction of the 
notions of mathematics and logic to a minimum, for example, is 
prompted by much more than the special purpose mentioned 
earlier. It is indeed an end in itself, for it reveals just what few 
notions are finally presupposed by the whole of mathematics. 
Through this process of paraphrasing some notions in terms of 
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others, moreover, and finally paraphrasing all in terms of the 
chosen few, the several mathematical and logical notions of course 
undergo illuminating analysis. 

Schematization of the logical language is of significance also in 
more practical spheres. Logic has its practical use in inference 
from premisses which are not logical truths to conclusions which 
are not logical truths. Logic countenances such inference when 
the conditional statement ‘If . . . then . . . ’ connecting premiss 
with conclusion is itself logically true (like (1) above); and it is in 
this way that logical truth links up with extra-logical concerns. 
Precisely the analogous account holds with regard to applications 
of mathematics generally; the tremendous utility of mathematical 
techniques in natural science turns simply on the importance of 
discerning mathematical truths of the form ‘If. . . then . . .’ 
whose component parts are statements of natural science. Unlike 
the numerical branches of mathematics, however, logic itself has 
traditionally figured in natural science only tacitly and at a pretty 
rudimentary level of inference; it has played fully as subordinate 
a role as arithmetic might have played in the days of Roman 
numerals. But when we schematize logic along the lines of modern 
mathematics (retaining convenient abbreviations, of course, over 
and above the minimum logical vocabulary), we have a tool which 
is about as effective as arithmetic and derivative branches of 
mathematics — and which far surpasses these latter in scope of 
applicability. 

Where number is irrelevant, regimented mathematical technique 
has hitherto tended to be lacking. Thus it is that the progress of 
natural science has depended so largely upon the discernment of 
measurable quantity of one sort or another. Measurement con¬ 
sists in correlating our subject matter with the series of real num¬ 
bers; and such correlations are desirable because, once they are 
set up, all the well-worked theory of numerical mathematics lies 
ready at hand as a tool for our further reasoning. But no science 
can rest entirely on measurement, and many scientific investiga¬ 
tions are quite out of reach of that device. To the scientist longing 
for non-quantitative techniques, then, mathematical logic brings 
hope. It provides explicit techniques for manipulating the most 
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basic ingredients of discourse. Its yield for science may be ex¬ 
pected to consist also in a contribution of rigor and clarity — a 
sharpening of the concepts of science. Such sharpening of con¬ 
cepts should serve both to disclose hitherto hidden consequences 
of given scientific hypotheses, and to obviate subtle errors which 
may stand in the way of scientific progress. 

Mathematical logic has been applied, 1 but the most important 
applications are surely still to come. The usefulness of a theory is 
not to be measured solely in terms of the application of prefabri¬ 
cated techniques to preformulated problems; we must allow the 
applicational needs themselves, rather, to play their part in 
motivating further elaborations of theory. The history of mathe¬ 
matics has consisted to an important degree in such give and take 
between theory and application. Much of the promise of mathe¬ 
matical logic for science lies in its potentialities as a basis from 
which to construct subsidiary techniques of unforeseen kinds in 
response to special needs. 

1 See e.g. Berkeley, Carnap (. Aufbau ), Dittrich, Hempel and Oppenheim, Hull, 
Quine (“Relations and Reason”), Russell (“Order in Time”), Shannon, Stamm, 
Whitehead (“Mathematical Concepts,” “Theorie relationniste”), Wiener (“Syn¬ 
thetic Logic,” “Measurement”), Woodger, Zwicky. 
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STATEMENTS 




§ 1 . Conjunction, Alternation, and Denial 


MATHEMATICAL logic at its most elementary level deals with 
statements, or declarative sentences, and with ways of compound¬ 
ing them into further statements. One of these modes of com¬ 
position, known to logicians as conjunction, consists in joining two 
statements by ‘and’ —or, in the notation of mathematical logic, 
by the dot ‘. ’. ‘Some are born great and some achieve great¬ 
ness’, or ‘some are born great . some achieve greatness’, is the 
conjunction of ‘some are born great’ and ‘some achieve great¬ 
ness’. A conjunction is true if both component statements are 
true; otherwise false. To determine the truth value (truth or 
falsehood) of a conjunction, therefore, it is sufficient to know the 
truth values of the components. 

A mode of statement composition will be said to be truth- 
functional if the truth value of the compound is determined in all 
cases by the truth values of the components. Conjunction is thus 
truth-functional. So are all other modes of statement composition 
required for mathematical logic, or mathematics generally. There 
is reason to believe that none but truth-functional modes of 
statement composition are needed in any discourse, mathematical 
or otherwise; but this is a controversial question. (Cf. § 5.) 

Each such mode of composition is exhaustively describable by a 
truth table — a table specifying the truth value of the compound for 
each assignment of truth values to the components. Thus the 
table for conjunction is as follows: 


1st component 

2d component 

Conjunction 

T 

T 

T 

F 

T 

F 

T 

F 

F 

F 

F 

F 


It tells us that the conjunction is true when both components are 
true, false when the first component is false and the second true, 
and false likewise in the two remaining cases. 

11 
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A second mode of composition, no less familiar than conjunction, 
is alternation — composition of statements by means of the con¬ 
nective ‘or’. Reflection upon ordinary usage of ‘or’ reveals that 
an alternation is true if just one of its components is true, and false 
if they are both false. This gives us three-quarters of a truth table 


for alternation: 



ist component 

2d component 

Alternation 

T 

T 


F 

T 

T 

T 

F 

T 

F 

F 

F 


But the top row is indicated less clearly by usage. We must de¬ 
cide whether ‘or’ is to be construed in an exclusive sense, corre¬ 
sponding to the Latin ‘aut’, or in an inclusive sense, corresponding 
to the Latin ‘vel’. When ‘or’ is used in the exclusive sense, the 
compound is regarded as true only if exactly one of the two com¬ 
ponents is true; joint truth of the components falsifies the com¬ 
pound. An ‘or’-compound in this sense can be expressed more 
clearly by adding the words ‘but not both’. When ‘or’ is used 
in the inclusive sense, on the other hand, the compound is regarded 
as true if at least one of the components is true; joint truth of the 
components verifies the compound. An ‘or’-compound in this 
sense can be expressed more clearly by adding the words ‘or 
both’, or by admitting the barbarism ‘and/or’. The blank in 
our table receives an ‘F’ or a ‘T’ according as ‘or’ is construed 
in the exclusive or the inclusive sense. 

Common usage can be adduced in support of either course. If a 
witness guesses that either the steering gear was loose or the 
driver was drunk, and it is found that the steering gear was loose 
and the driver was drunk, we do not regard the witness as mis¬ 
taken; to this extent usage supports the inclusive ‘or’. At the 
same time the prevalent use of the expressions ‘or both’ and 
‘and/or’ is a presumption in favor of the exclusive interpretation, 
since otherwise these expressions would always be superfluous. 

In mathematical logic the ambiguity of ordinary usage is re- 
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solved by adopting a special symbol ‘v’, suggestive of ‘vel’, to 
take the place of ‘or’ in the inclusive sense. Alternation is 
identified with this usage; thus the truth table of alternation 
becomes: 


1st component 

2d component 

Alternation 

T 

T 

T 

F 

T 

T 

T 

F 

T 

F 

F 

F 


The exclusive use of ‘or’ is not frequent enough in technical 
developments to warrant a special name and symbol. 

The connectives ‘and’ and ‘or’ are used in ordinary discourse 
not only between statements but between nouns, verbs, preposi¬ 
tions, indeed nearly all parts of speech. It is only as connectives of 
statements, however, that they receive the symbolic rendering 
‘ . ’ and ‘v’. The other uses do not fall under the head of con¬ 
junction and alternation. 

Conjunction and alternation are binary, in that they combine 
statements two at a time. But denial, to which we now turn, is 
singulary : 1 it is a method merely of elaborating a single state¬ 
ment to form a new statement. It has the following truth table, 
which is limited to two rows because there is just one compo¬ 
nent. 

Component Denial 

T F 

F T 

The denial is true if and only if the original statement is false. 

The method of forming the denial in ordinary language is irreg¬ 
ular. Sometimes ‘not’ is attached to the main verb; thus the 
denial of ‘Jones is away’ is ‘Jones is not away’. But if the verb 

1 The series of adjectives ‘binary’, ‘ternary’, ‘quaternary’, ‘quinary’,.,, 
leaves mathematicians in a quandary when n = 1. It is customary to stammer 
out some such makeshift as ‘unary’ or ‘uninary’ or ‘unitary’. But the proper 
word is apparent if we reflect that the series of Latin distributives ‘bini’, ‘terni’, 
‘quaterni’, ‘quini’, . . . begins with ‘singuli’. 
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is governed by ‘sometimes’ or ‘always’, the denial is formed 
rather by substituting ‘never’ or ‘not always’. If the statement 
is a conjunction or alternation and hence has no main verb, then 
denial is accomplished by one or another periphrasis. 

In mathematical logic the denial of a statement is formed by 
prefixing the tilde which is a modified ‘n’ and is con¬ 

veniently read ‘not’. Thus ‘Jones is not away’ gives way to 
Jones is away’. Over the ordinary use of ‘not’ this notation 
has two advantages: it leaves the internal constitution of the 
affected statement intact, and it applies immediately to state¬ 
ments of any form and complexity. 

The first branch of mathematical logic to reach maturity was the so-called algebra 
of logic, which was founded by Boole (1847) and improved by Jevons, Peirce, and 
Schroder. This algebra, while regarded primarily as an algebra of classes (cf. § 33), 
was recognized as amenable to reinterpretation as a theory governing conjunction, 
alternation, and denial. Truth tables, and the graphic method of calculation which 
they provide (cf. §3), came only in 1920-21 (Lukasiewicz, Post, Wittgenstein); 
but much the same technique in non-tabular form was made known by Peirce 
(3.387 f) in 1885, and is indeed implicit in Boole’s “general rule of development” 
(Laws of Thought, p. 75 f). The signs ‘V’, and are from Whitehead and 
Russell. 


§ 2 . The Conditional 

ANOTHER binary connective, and a particularly important one, 
is ‘if-then’. A statement thus compounded is known as a con¬ 
ditional (or hypothetical). The first of its two component state¬ 
ments, placed between ‘if’ and ‘then’, is called the antecedent (or 
protasis or hypothesis) of the conditional; the second component, 
placed after ‘then’, is called the consequent (or apodasis or 
thesis). 

Any conditional with true antecedent and false consequent is 
false. The conditional ‘If Jones was here then the glove is his’ 
is clearly disproved when it turns out that Jones was here and the 
glove is not his. But none of the other three combinations of truth 
values is adequate to disproving the conditional. In case Jones 
was not here, the conditional stipulates nothing regarding the 
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ownership of the glove; and in case Jones was here and the glove 
is his, the conditional is supported. Are we then to construe a 
conditional as false in the one case and true in the other three? 


Antecedent 

Consequent 

Conditional 

T 

T 

T 

F 

T 

T 

T 

F 

F 

F 

F 

T 


The mode of composition described in the table constitutes the 
nearest truth-functional approximation to the conditional of 
ordinary discourse, and may be called the truth-functional con¬ 
ditional. All truth-functional conditionals with false antecedents 
and all with true consequents are true. Only those are false which 
have true antecedents and false consequents. If we construe 
‘if-then’ in this sense, the following become true: 

(1) If France is in Europe then the sea is salt, 

(2) If France is in Australia then the sea is salt, 

(3) If France is in Australia then the sea is sweet. 

Symbolically the truth-functional conditional is rendered by the 
connective thus: 

(4) Jones was here D the glove is his. 

(4) is true in case Jones was not here, true also in case the glove is 
his, and false just in case he was here and the glove is not his. It is 
true whenever at least one of the two statements l ~ Jones was 
here’ and ‘the glove is his’ is true, and false in the case where 
‘Jones was here’ and ‘~the glove is his’ are both true. (4) can 
hence be paraphrased in terms of denial and alternation thus: 

(5) ~ Jones was here V the glove is his, 
or in terms of denial and conjunction thus: 

(6) ~ (Jones was here . ~ the glove is his). 

The parentheses in (6) indicate the grouping, as usual in mathe¬ 
matics. They indicate that the denial sign applies to the whole 
conjunction. When there is no such indication, a denial sign 
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always applies to the shortest statement following it; thus (5) is 
understood as: 

(~ Jones was here) v the glove is his 

rather than: 

(Jones was here v the glove is his). 

A statement of the form ‘-D-’ is conveniently pro¬ 
nounced in the usual fashion ‘ If-then-V hut we have now 

to consider to what extent this conforms to the ordinary usage of 
‘if-then’. Let us examine the latter more closely. The subjunctive 
mood is ordinarily used in a conditional when the speaker believes 
the antecedent to be false, and the indicative mood when he wants 
to leave the antecedent unprejudiced. Usage of the subjunctive 
conditional is certainly at variance with the above table; for if all 
conditionals with false antecedents are true, no subjunctive con¬ 
ditional is worth affirming. The subjunctive conditional is in fact 
not directly identifiable with any truth-functional mode of com¬ 
position, but calls for a more elaborate analysis, (Cf. § 5.) This 
task is important for the philosophy of science, for the subjunctive 
conditional is implicit in those terms of disposition or potentiality 
with which the natural sciences abound: ‘soluble’, ‘malleable’, 
'hard’, ‘sensitive’, ‘intelligent’, etc. The task is not one, however, 
which need be accomplished within the framework of mathemat¬ 
ical logic. Mathematics makes no use of the subjunctive condi¬ 
tional; the indicative form suffices. It is hence useful to shelve 
the former and treat the ‘if-then’ of the indicative conditional as a 
connective distinct from ‘if-then’ subjunctively used. It remains 
to consider whether the above table gives an acceptable version of 
the indicative conditional. 2 

1 I he reading ‘implies’, for O’, is to be avoided (cf. §5). Its popularity 
probably arises from the fact that the ‘if-then’ reading calls for an inconvenient 
excursion into the context; ‘then’ supplants ‘ D but ‘if’ has to be inserted in an 
earlier position. Note however that we can avoid this inconvenience by reading 
O ’, at will, simply as ‘only if’. A little reflection shows that ‘If—then—’ can 
always be given the alternative rendering 1 —only if —', without disturbing the 
order of the components. Whereas ‘—if—’ is of course the reverse of ‘If — 
then —’, insertion of ‘only ’ has the peculiar effect of restoring the normal order of 
antecedent and consequent. 

2 Not only the present section but also § .5 is relevant to the controversy which 
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One who affirms an indicative conditional is ordinarily uncertain 
as to the truth values of both antecedent and consequent. He 
believes it is not the case that the first is true and the second false, 
but ordinarily he does not know whether they are both true, or 
both false, or first false and second true. It is for this reason, per¬ 
haps, that the truth of (l)-(3) seems strange; not because these 
would be false under ordinary usage, but because they would be 
idle or senseless. Indeed, since usage conforms to the third line of 
the table, and usage lapses as soon as a case is precisely located 
elsewhere in the table, there is no clear conflict between the table 
and the indicative conditional of ordinary usage. The table de¬ 
parts from usage only in committing itself on points where usage 
is lacking. This much departure is necessary in any complete 
formulation — any formulation which explains the conditional for 
every pair of statements. 

What the truth table adds, in thus deciding the cases beyond the 
range of ordinary usage, is essentially theoretical; no supplemen¬ 
tary practical use of ‘if-then’ is thereby prescribed. In practice, 
even in the light of the truth table, one would naturally not bother 
to affirm a conditional if he were in position to affirm the conse¬ 
quent outright or to deny the antecedent — any more than one 
bothers to affirm an alternation when he knows which component 
is true. We say: 

(7) If Jones has malaria then he needs quinine 

because we know about malaria but are in doubt both of Jones’s 
ailment and of his need of quinine. Thus only those conditionals 
are worth affirming which follow from some manner of relevance 
between antecedent and consequent — some manner of law, per¬ 
haps, connecting the matters which these two component state¬ 
ments describe. Such connection underlies the useful application 
of the truth-functional conditional without participating in the 
meaning of that notion. The situation is similar, indeed, in the 
case of alternation. 

A conditional thus inferred from a general law must be dis- 

has arisen over this point. Those acquainted with the controversy should keep in 
mind that in the present section nothing is said of implication. 
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tinguished from the law itself, which commonly bears the outward 
form of a conditional in turn: 

(8) If one has malaria then he needs quinine. 

Certainly (8) asserts a connection between malaria and quinine 
which transcends any truth table; but actually the statement is 
elliptical, and analysis deprives it of the form of a pure conditional. 
It is short for some such statement as this: 

(9) Whatever person be selected, if he has malaria then he needs 

quinine. 

This statement is of an important kind which will be taken up in 
Chapter II under the head of quantification. Its initial part con¬ 
fers universality; the remainder has no separate status as a state¬ 
ment at all, for it contains pronouns which refer back to the initial 
part and lose all meaning on isolation. (9) is not itself a condi¬ 
tional, but has the effect of simultaneous affirmation of a vast 
array of conditionals. Each of these separate conditionals, e.g. 
(7), can still be construed in accordance with the truth table; the 
strong interconnection between malaria and quinine which (9) 
conveys is not to be laid to the conditional mode of composition 
but to the initial indication of universality. 


The truth-functional version of the conditional goes back to Philo of Megara. 
It was revived in modern logic by Frege (1879) and Peirce (1885). The conditional 
sign ‘ D ’ was used by Gergonne as early as 1816, though not in the truth-functional 
sense. The appropriateness of the truth-functional version was vigorously debated 
in ancient times (cf. Peirce, 3.441 ff; Lukasiewicz, “Zur Geschichte”, p. 116), and 
has become a current topic of controversy as well. The issue has been clouded, 
however, by failure to distinguish clearly between the conditional and impli¬ 
cation (cf. § 5) 


§3. Iterated Composition 


WHEN truth-functional composition is applied in iteration to 
form a complex statement, e.g.: 

(1) Germany will withdraw V (France will invade . England will 
mobilize), 
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(2) (Jones was here D the glove is his) . (the glove is his 9 Jones 

was here), 

(3) (Jones came . ~ (Smith stayed . ~ Robinson left)) v Robin¬ 

son left, 

we can always calculate the net import of the result by truth 
tables. We can determine systematically what truth values on the 
part of the ultimate components would make the whole true and 
what ones would make the whole false. 

This is accomplished, for example in the case of (2), by assigning 
all possible combinations of truth values to the ultimate com¬ 
ponents ‘Jones was here’ and ‘the glove is his’ and then deriving 
the truth value of (2) for each of these assignments by consulting 
the truth tables of the modes of composition involved. The process 
can conveniently be carried out by a tabular construction whose 


(Jones was 

here D the glove is 

his).(the 

glove is 

his 3 Jones was here) 

T 

T T 

T 

T 

T 

T 

F 

T T 

F 

T 

F 

F 

T 

F F 

F 

F 

T 

T 

F 

T F 

T 

TABLE 1 

F 

T 

F 

outcome is 

Table 1. The first and third columns, standing beneath 


‘Jones was here’ and ‘the glove is his’, exhaust the four combina¬ 
tions of truth values; they are the familiar columns used in earlier 
tables. These columns are repeated under the repeated occurrences 
of ‘Jones was here’ and ‘the glove is his’. Each of the three re¬ 
maining columns stands beneath a statement connective, and lists 
the truth values, for the four cases, of the compound produced by 
that connective. Thus, the second column indicates the truth 
value of the conditional: 

Jones was here 0 the glove is his 

for the successive cases; the four entries in the column were found 
by inspecting the two adjacent columns and taking the appropriate 
values for the conditional according to the conditional table (§ 2). 
The next to last column, indicating the value of: 
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The glove is his 3 Jones was here 

for the successive cases, was obtained similarly from its adjacent 
columns. Finally, the middle column indicates the value of the 
whole conjunction (2) for the successive cases; the four entries 
here were found by inspecting the columns belonging to the two 
immediate components of the conjunction, namely the two con¬ 
ditionals, and taking the appropriate values for the whole accord¬ 
ing to the conjunction table (§ 1). 

The completed Table 1 tells us, then, in its middle column, that 
(2) is true in just the first and last cases: true if Jones was here and 
the glove is his, true if Jones was not here and the glove is not his, 
and false otherwise. 

The form of iterated composition illustrated in (2) is, as it hap¬ 
pens, a particularly important one; for it corresponds to the idiom 
‘if and only if’. Symbolically it is usually rendered m abbreviated 
fashion by the connective ' = ’; thus (2) becomes: 

Jones was here = the glove is his. 

This binary mode of composition may be called the biconditional, 
more specifically the truth-functional biconditional, for it is the 
conjunction of opposite truth-functional conditionals. It has 
Table 2 as its truth table, for we saw that (2) is true just in case 


1st component 

2 d component 

Biconditional 

T 

T 

T 

F 

T 

F 

T 

F 

F 

F 

F 

TABLE 2 

T 


the components are both true or both false. The biconditional is 
true just in case its components are alike in truth value; false 
just in case one component is true and the other false. This mode 
of composition is related to the ‘if and only if’ of ordinary usage 
precisely as the truth-functional conditional is related to the 
ordinary usage of ‘if-then’ (cf. § 2). 
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1 st component 

2 d component 

jd component 

T 

T 

T 

F 

T 

T 

T 

F 

T 

F 

F 

T 

T 

T 

F 

F 

T 

F 

T 

F 

F 

F 

F 

F 


TABLE 3 



Let us now analyze (3). Here there are three ultimate com¬ 
ponents, ‘Jones came’, ‘Smith stayed’, and ‘Robinson left’, in¬ 
stead of just two. Our first task, then, is to exhaust the ways of 
assigning truth values to three components. A convenient sys¬ 
tematic method is shown in Table 3. The upper half of this array 
was formed by setting down the familiar array for two components 
and adjoining a third column composed uniformly of ‘T’; the 
lower half was formed similarly using ‘F’. The array for four 
components would be formed from this array for three components 
by the same method, and so on. Obviously this scheme always 
exhausts the combinations. In general, we see that the array for 
n components will have 2” rows; and that ‘T’ and ‘F’ will always 
be alternated simply in the first column, in pairs in the second 
column, in fours in the third column, and so on. 

We have now to calculate the truth value of (3) for each of the 
eight ways of assigning truth values to the three ultimate com¬ 
ponents. The construction, analogous to that of Table 1, has 
Table 4 as its outcome. The columns under ‘Jones came’, ‘Smith 
stayed’, and ‘Robinson left’ are the columns of Table 3, with 
repetition for the repeated component ‘Robinson left’. The sixth 
column indicates the truth values of the denial ‘~ Robinson left’ 
for the eight cases; its entries are derived from those in the seventh 
column by means of the denial table. The fifth column indicates 
the values of: 

Smith stayed . ~Robinson left; 

it is derived from the adjacent columns by the conjunction table. 
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{Jones came . ~(Smith stayed. ~Robinson left)) v Robinson left 

TTT tff t t t 

F FT TFF T T T 

TTT FFF T T T 

F FT FFF T T T 

TFF TTT F F F 

FFF TTT F F F 

TTT F FT F T F 
F FT F FT F F F 

table 4 

The third column indicates the values of the denial of this con¬ 
junction; it is derived from the fifth column by the denial table. 
The second column indicates the values of: 

Jones came . ''-'(Smith stayed . ~Robinson left); 
it is derived from the adjacent columns by the conjunction table. 
Finally, the eighth column indicates the values of the whole al¬ 
ternation (3); it is derived from the second and last columns by the 
alternation table. 

Table 4, thus constructed, tells us in its eighth column that (3) is 
true in all cases except the fifth, sixth, and last. Thus (3) is false if 
Jones came, Smith stayed, and Robinson did not leave; false also 
if Jones did not come, Smith stayed, and Robinson did not leave; 
false also if Jones did not come, Smith did not stay, and Robinson 
did not leave; but true in all other cases. 

In Table 5 the method is applied to a case where there is just one 
ultimate component. Here there are just two rows, exhausting the 
truth possibilities for the single component. The third column 
shows that the whole is true if Jones was not here, otherwise false. 

-—'Jones was here = {Jones was here D Jones was here) 

F T F TTT 

TFT F T F 

TABLE 5 
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H. Use versus Mention 

IN THE literature on the logic of statements, and in other founda¬ 
tional studies of mathematics as well, confusion and controversy 
have resulted from failure to distinguish clearly between an object 
and its name. Ordinarily the failure to maintain this distinction is 
not to be attributed to any close resemblance between the object 
and the name, even if the object happens to be a name in turn; for 
even the discrimination between one name and another is a visual 
operation of an elementary kind. The trouble comes rather in 
forgetting that a statement about an object must contain a name 
of the object rather than the object itself. If the object is a man or 
a city, physical circumstances prevent the error of using it instead 
of its name; when the object is a name or other expression in turn, 
however, the error is easily committed. 

As an illustration of the essential distinction, consider these 
three statements: 


(1) 

Boston is populous, 

(2) 

Boston is disyllabic, 

(3) 

‘Boston’ is disyllabic. 


The first two are incompatible, and indeed (1) is true and (2) false. 
Boston is a city rather than a word, and whereas a city may be 
populous, only a word is disyllabic. To say that the place-name in 
question is disyllabic we must use, not that name itself, but a name 
of it. The name of a name or other expression is commonly formed 
by putting the named expression in single quotation marks; the 
whole, called a quotation, denotes its interior. This device is used 
in (3), which, like (1), is true. (3) contains a name of the di¬ 
syllabic word in question, just as (1) contains a name of the 
populous city in question. (3) is about a word which (1) contains; 
and (1) is about no word at all, but a city. In (1) the place-name 
is used, and in this way the city is mentioned; in (3) a quotation is 
used, and the place-name is mentioned. We mention x by using a 
name of x\ and a statement about x contains a name of x. 1 

1 By these considerations, the first sentence of the present paragraph might be 
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The foregoing treatment of (l)-(3) is itself replete with mention 
of expressions, yet free from quotations. These were avoided by 
circumlocution. As an exercise in quotation marks, however, it 
may be useful now to add a few comments involving them. ‘ Bos¬ 
ton is populous’ is about Boston and contains ‘Boston’; ‘ ‘Boston’ 
is disyllabic’ is about ‘Boston’ and contains ‘ ‘Boston’ ’. ‘ ‘Bos¬ 
ton’’ designates ‘Boston’, which in turn designates Boston. To 
mention Boston we use ‘Boston’ or a synonym, and to mention 
‘Boston’ we use ‘ ‘Boston’ ’ or a synonym. ‘ ‘Boston’ ’ contains 
six letters and just one pair of quotation marks; ‘Boston’ contains 
six letters and no quotation marks; and Boston contains some 
800,000 people. 

Such examples as (3), or: 

(4) ‘Boston’ has six letters, 

(5) ‘Boston’ is a noun, 

(6) ‘Boston’ occurs in Walt Whitman’s Chants Democratic, 

must not be thought of as exhausting the kinds of things that can 
be said about an expression. These four statements ascribe prop¬ 
erties to ‘Boston’ which might be classed respectively as phonetic, 
morphological, grammatical, and literary; roughly speaking, they 
have nothing to do with meaning. But an expression which has 
use in language will also have semantic properties, or properties 
which arise from the meaning of the expression. Such properties 
are ascribed to ‘Boston’ by these statements: 

(7) ‘Boston’designates Boston, 

(8) ‘Boston’ designates a populous city, 

(9) ‘Boston’ designates the capital of Massachusetts, 

(10) ‘Boston’ is synonymous with ‘the capital of Massachu¬ 
setts’. 

(7)—(10) are just as genuinely statements about ‘Boston’ as are 
(3)—(6), and omission of the quotation marks from (7)—(10) 
would give results no less objectionable than (2); for it is only 

criticized for failure to enclose the whole statements (l)-(3) in quotation marks. 
But it is clearer to avoid quotation, in such cases, by agreeing to regard the colon 
as equivalent to quotation when followed by displayed text (text centered in a new 
line). 
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expressions, not places, that can designate or be synonymous. A 
statement about an expression may depend for its verification 
upon considerations of sound or shape or literary locale, or even 
upon considerations of population or other extra-linguistic matters 
of fact with which the expression is indirectly connected by its 
use; but so long as the statement is about the expression it must 
contain a name of the expression. 

Lack of care in thus distinguishing the name from the named is 
common in mathematical writings. The following passage, from 
a widely used textbook on the differential calculus, is fairly typical: 

The expression D x yAx is called the differential of the func¬ 
tion and is denoted by dy : 

dy = D x y Ax. 

The third line of this passage, an equation, is apparently supposed 
to reproduce the sense of the first two lines. But actually, whereas 
the equation says that the entities dy and D x yAx (whatever these 
may be) are the same, the preceding two lines say rather that the 
one is a name of the other. And the first line of the passage in¬ 
volves further difficulties; taken literally it implies that the 
exhibited expression ‘ D x yAx ’ constitutes a name of some other, 
unexhibited expression which is known as a differential. But all 
these difficulties can be removed by a slight rephrasing of the 
passage: drop the first two words and change ‘and is denoted by’ 
to ‘or briefly’. 

Expository confusions of this sort have persisted because, in 
most directions of mathematical inquiry, they have not made 
themselves felt as a practical obstacle. They do give rise to minor 
perplexities, indeed, even at the level of elementary arithmetic. 
A student of arithmetic may wonder, e.g., how 6 can be the 
denominator of $ and not of | when f is f; this puzzle arises from 
failure to observe that it is the fractions ‘f ’ and ‘f’ that have 
denominators, whereas it is the designated ratios 4 and f that are 
identical. But it is primarily in mathematical logic that careless¬ 
ness over these distinctions is found to have its more serious 
effects. At the level of the logic of statements, one effect is obliter¬ 
ation of the distinction between predicates of statements and 
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composition of statements — a distinction which will be con¬ 
sidered in the next section. 

Scrupulous use of quotation marks is the main practical measure 
against confusing objects with their names. But it has already 
been suggested that this particular method of naming expressions 
is not theoretically essential. E.g., using elaborately descriptive 
names of‘Boston’, we might paraphrase (3) in either of the follow¬ 
ing ways: 

The word composed successively of the second, fifteenth, 
nineteenth, twentieth, fifteenth, and fourteenth letters of the 
alphabet is disyllabic. 

The 4354th word of Chants Democratic is disyllabic. 

Quotation is the more graphic and convenient method, but it has 
a certain anomalous feature which calls for special caution: from 
the standpoint of logical analysis each whole quotation must be 
regarded as a single word or sign, whose parts count for no more 
than serifs or syllables. A quotation is not a description, but a 
hieroglyph; it designates its object not by describing it in terms of 
other objects, but by picturing it. The meaning of the whole does 
not depend upon the meanings of the constituent words. The 
personal name buried within the first word of the statement: 

(11) ‘Cicero’has six letters, 

e.g., is logically no more germane to the statement than is the verb 
‘let’ which is buried within the last word. Otherwise, indeed, the 
identity of Tully with Cicero would allow us to interchange these 
personal names, in the context of quotation marks as in any other 
context; we could thus argue from the truth (11) to the falsehood: 

‘Tully’ has six letters. 


Frege seems to have been the first logician to recognize the importance of scru¬ 
pulous use of quotation marks for avoidance of confusion between use and mention 
of expressions (cf. Grundgesetze , vol. 1, p, 4); but unfortunately his counsel and good 
example in this regard went unheeded by other logicians for some thirty years. 
For further discussion of this topic see Carnap, Syntax, pp. 153-160. Concerning 
the necessity of treating a whole quotation as a single sign, see also § 1 of Tarski’s 
“ WahrheitsbegrifF”. 
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§5. Statements about Statements 

TO SAY that a city or a word has a given property, e.g. populous¬ 
ness or disyllabism, we attach the appropriate predicate to a name 
of the city or word in question (cf. § 4). To say that a statement 
has a given property, e.g. the phonetic property of being a hexam¬ 
eter or the semantic property of truth or falsehood, we attach the 
appropriate predicate to a name of the statement in question — 
not to the statement itself. Thus, to attribute truth to: 

(1) Jones is ill 
we write: 

(2) ‘Jones is ill’ is true, 
and to attribute falsehood we write: 

(3) ‘Jones is ill’ is false. 

Equivalently, we may write: 

(4) (1) is true, 

(5) (1) is false; 

but never: 

(6) Jones is ill is true, 

(7) Jones is ill is false, 

on the analogy of: 

(8) ~ Jones is ill. 

(2)-(5) are about the statement (1), but (8) is not; it, like (1), is 
about Jones. ‘Is true’ and ‘is false’ attach to names of statements 
precisely because, unlike they are predicates by means of 

which we speak about statements. Whereas statement connectives 
‘v’, ‘D’, ‘ = ’) attach to statements to form statements, a 
predicate is an expression which attaches to names to form state¬ 
ments. Grammar alone is enough to condemn (6) and (7), since 
each occurrence of‘is’ should have a noun as subject. Confusion 
over this matter results in the view that the suffix ‘is true’ is 
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vacuous, and that the suffix ‘is false’ is the English translation of 
the prefix the view, in other words, that (6) is equivalent to 

(1) and (7) to (8). 

In order to say that two objects stand in a given relation, e.g. 
hate, or remoteness, one puts an appropriate binary predicate 
(transitive verb) between names of the objects thus: ‘Roosevelt 
hates Hitler’, ‘Berlin is far from Washington’. To say that two 
statements stand in a given relation, whether the phonetic relation 
of rhyming or the semantic relation of implication, we put the 
appropriate binary predicate between names of the statements —■ 
not between the statements themselves. We may write: 

(9) ‘All men are mortal’ implies ‘all white men are mortal’, 

(10) The third statement of the book implies the seventh, 

but never: 

(11) All men are mortal implies all white men are mortal 
on the analogy of: 

(12) If all men are mortal then all white men are mortal, 

(13) All men are mortal 3 all white men are mortal. 

The verb ‘implies’ belongs between names of statements precisely 
because, unlike ‘D’ or ‘if-then’, it expresses a relation between 
statements; it is a binary predicate by means of which we talk 
about statements. (9) and (10) are about statements, while (12) 
and (13) are about men. 

The relation of implication in one fairly natural sense of the term, 
viz. logical implication , is readily described with help of the auxi¬ 
liary notion of logical truth. A statement is logically true if it is not 
only true but remains true when all but its logical skeleton is varied 
at will; in other words, if it is true and contains only logical ex¬ 
pressions essentially, any others vacuously (cf. Introduction). 
Now one statement may be said logically to imply another when 
the truth-functional conditional which has the one statement as 
antecedent and the other as consequent is logically true. Thus 
(9), so construed, is equivalent to: 

(13) is logically true. 
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A trivial analogue, material implication, may be said to hold 
whenever the truth-functional conditional which has the one state¬ 
ment as antecedent and the other as consequent is true. Thus one 
statement materially implies another provided merely that the 
first is false or the second true. This relation is so broad as not to 
deserve the name of implication at all except by analogy. But — 
and this is the point usually missed — ‘materially implies’ is still 
a binary predicate, not a binary statement connective. It stands 
to ‘3’ precisely as ‘is false’ stands to Insertion of the con¬ 

nective ‘3’ between statements as in (13) amounts to inserting the 
verb ‘materially implies’, not between the statements themselves 
as in (11), but between their names as in (9). 

With a few trivial exceptions such as material implication, any 
relation between statements will depend on something more than 
the truth values of the statements related. Such is the case, e.g., 
with the phonetic relation of rhyming. The same holds for the 
semantic relation of logical implication described above, and for 
any other relation which has (unlike material implication) a serious 
claim to the name of implication. Such relations are quite con¬ 
sonant with a policy of shunning non-truth-functional modes of 
statement composition (cf. § 1), since a relation of statements is 
not a mode of statement composition. On this account, the policy 
of admitting none but truth-functional modes of statement com¬ 
position is not so restrictive as might have at first appeared; what 
could be accomplished by a subjunctive conditional or other non¬ 
truth-functional mode of statement composition can commonly 
be accomplished just as well by talking about the statements in 
question, thus using an implication relation or some other strong 
relation of statements instead of the strong mode of statement 
composition. Instead of saying: 

If Perth were 400 miles from Omaha then Perth would be in 

America 

one might say: 

‘ Perth is 400 miles from Omaha’ implies ‘ Perth is in America’, 
in some appropriate sense of implication. 
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Much of what has been said regarding implication applies 
equally to other semantic relations of statements, e.g. equivalence 
and compatibility. Statements are logically equivalent when they 
logically imply each other, and logically compatible when one does 
not imply the other’s denial. Or, what comes to the same thing, 
statements are logically equivalent when the biconditional formed 
from them is logically true, and they are logically compatible ex¬ 
cept when the conjunction formed from them is logically false, i.e., 
except when the denial of the conjunction is logically true. Trivial 
analogues, material equivalence and compatibility, are similarly 
determined: statements are materially equivalent when they 
materially imply each other, and materially compatible when one 
does not materially imply the other’s denial. Or, what comes to 
the same thing, statements are materially equivalent whenever 
their biconditional is true, and materially compatible except when 
their conjunction is false, hence whenever their conjunction is true. 
Material equivalence is agreement in truth value, and material 
compatibility is joint truth. Equivalence and compatibility, even 
in this degenerate sense, must be distinguished from the bicondi¬ 
tional and conjunction; insertion of ‘ = ’ or between statements 
amounts to inserting ‘is materially equivalent to’ or ‘is materially 
compatible with’, not between the statements themselves, but 
between their names. 

Note that ‘is true’, ‘is false’, ‘implies’, ‘is equivalent to’, etc. 
do not admit of iterated application as do the statement connec¬ 
tives. The expressions which ‘~’, ‘3’, etc. govern and the 

expressions which they produce are homogeneously statements; 
the expressions produced can hence be so governed in turn, and 
thus we obtain statements of the forms: 

~~-, -3(-3-), (~-■-)3~-» 

etc. But ‘is false’, ‘implies’, etc. govern names and produce state¬ 
ments; hence the expressions produced cannot be so governed in 
turn. Formally the predicates ‘is false’ and ‘implies’ resemble the 
predicates ‘is negative’ and ‘ ^ ’ of arithmetic rather than the 
statement connectives ‘~’ and ‘3’. Just as it is true for all 
numbers x, y, z that 
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and (y is negative . x ^ y) D x is negative, 
so it is true for all statements <p, ip, x that 

(<j> implies ip . ip implies x) 3 4> implies x 
and (ip i s false . <j> implies ip) 3 4> is false; 

on the other hand the contexts: 

(■<p implies ip . ip implies x) implies (<p implies x) 
and: (ip is false . <p implies ip) implies (0 is false) 

make no more sense than: 

(x ^ y - y ^ z) ^ (x ^ z), 

(y is negative . x ^ y) 5$ (x is negative). 

In Whitehead and Russell’s exposition and terminology the distinction between 
predicate and statement connective is blurred. The notation ‘— D —’ is ex¬ 
plained indiscriminately in the sense of the truth-functional conditional and in the 
sense of material implication. It is translated not only thus: 

(14) If — then — 
but also thus: 

(15) If—is true then — is true, 

(16) — is false or — is true, 

(17) —implies—. 

Similarly ‘ = * is explained both in the sense of the truth-functional biconditional 
and in the sense of material equivalence, and O—is explained both in the sense of 
denial and in the sense of falsehood. The authors even adopt ‘implication’ or 
‘material implication’ as their regular terminology in connection with O’, and 
‘equivalence’ in connection with ‘ = ’. Actually, as we have seen, the blanks in 
(14) admit only statements whereas those in (15)—(17) admit only names of state¬ 
ments. In the construction of examples, indeed, grammatical sense leads White- 
head and Russell to fill the blanks of ‘ — is true’, ‘— is false’, and ‘— implies —’ 
with quotations rather than statements; 1 but the distinction is straightway 
obliterated in the discussion. 

Once having noted the discrepancy between (14) and the other proposed trans¬ 
lations of ‘— D —’, one need not delay in making his choice. In all technical 
developments the expressions which Whitehead and Russell adjoin to the sign 

1 ut x wrote Waverlv’ is true” (vol. 1, p. 68); “‘the author . . . was a poet’ is 
false” (ibid.); “‘Socrates is a man’ implies ‘Socrates is mortal’” (ibid., pp. 20, 
138), 
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‘ D ’ have the form of statements rather than names. The mere fact of its iteration 
indeed, e.g. in the manner* 

-D(~D-h 

is enough to determine the sign as a statement connective rather than a predicate 
about statements. In short, the versions (15)—(17) do not operate in Whitehead 
and Russell’s work beyond the level of unfortunate exposition and nomenclature. 
The English idiom which ‘— D —’ supplants in practice is not (15), (16), or (17), 
but (14). The case is similar with ‘ = ’ and 

On the topic of implication Whitehead and Russell have many critics, who 
rightly object that the trivial relation of material implication expressed in (16) is 
too weak to constitute a satisfactory version of (17). But it is seldom observed that 
this objection does not condemn the truth-functional conditional ‘— D —’ as a 
version of‘if—then—’. Lewis, Smith, and others have undertaken systematic 
revision of ‘ D ’ with a view to preserving just the properties appropriate to a satis¬ 
factory relation of implication; but what the resulting systems describe are actually 
modes of statement composition — revised conditionals of a non-truth-functional 
sort — rather than implication relations between statements. 

If we were willing to reconstrue statements as names of some sort of entities, we 
might take implication as a relation between those entities rather than between the 
statements themselves; and correspondingly for equivalence, compatibility, etc. 
This procedure would dissolve the distinction between material implication and the 
truth-functional conditional, and likewise between other sorts of implication and 
other sorts of conditionals. ‘Implies’ would come to enjoy simultaneously the 
status of a binary predicate and the status of a binary statement connective. Ex¬ 
pressions such as (11) would be legitimized; and so also would the iterated use of 
implication, characteristic of Lewis and Smith. For thus construing statements as 
names some slight support can be adduced, indeed, by appeal to substantive clauses. 
The statement ‘All men are mortal’ might be held to designate that abstract entity, 
whatever it is, which we ordinarily designate by the substantive ‘that all men are 
mortal'. A deterring consideration, however, is the obscurity of these alleged 
entities. What are they like? and under what circumstances may the entities 
designated by two statements be said to be the same or different entities? Certain 
entities which are perhaps less obscure than these but no less abstract will indeed be 
countenanced at a later point (§ 22), viz. classes or properties, if only through ig¬ 
norance of how to get on without them; but entities designated by statements are 
happily dispensable. 1 It thus seems well to adhere to the common-sense view that 
statements are not names at all, though they may contain names along with verbs 
and adverbs and the rest. A statement remains meaningful, but meaningful by 
virtue of its structure together with the meanings of the constituent names and 
other words; its meaningfulness does not consist in its being a name of something. 

Conceding that ‘implies’ belongs between names of statements as in (9), rather 
than between statements, one might still urge rhat such a relation of implication 
produces a derivative mode of composition of the statements themselves — namely, 

1 See my “Ontological Remarks,” “Logistical Approach,” “Designation.” 
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a mode which consists notationally of compounding the statements by means of 
‘implies’ and the two pairs of quotation marks. 1 If implication is construed as 
going beyond questions of truth value, this derivative mode of statement composi¬ 
tion will not be truth-functional. Implication thus construed would then seem, after 
all, to interfere with a policy of admitting none but truth-functional modes of state¬ 
ment composition. By the same argument, indeed, a purely morphological or 
phonetic relation such as containing or rhyming would interfere similarly. Actu¬ 
ally, however, derivation of modes of statement composition from relations in the 
suggested fashion involves abuse of quotation. The statements buried in the 
quotations in (9) cannot be treated in turn as constituents of (9), for a quotation 
figures as a single irreducible word. Similar abuse of quotation was seen in § 4 to 
lead from ‘ ‘Cicero’ has six letters’ to ‘ ‘Tully’ has six letters’. 2 

These latter remarks serve only to show that we can construe implication as 
going beyond questions of truth value without thereby committing ourselves to any 
form of conditional which goes beyond questions of truth value. The need for some 
such strong form of conditional might still be urged on other grounds. Certainly 
not all uses of the subjunctive conditional submit to the easy method of paraphrase 
illustrated in the case of Perth and America. When this fails we may look to other 
devices, e.g. Carnap’s method of reduction sentences (“Testability,” pp. 439-453); 
but if any really useful cases prove to resist all such methods of analysis, then we 
shall perhaps have to choose pragmatically between the usefulness of those cases 
and the convenience and clarity of the truth functional kind of statement composi¬ 
tion. Mathematics itself gives rise to no such recalcitrant cases; and any which 
seem to arise beyond the bounds of mathematics should be critically regarded. 3 


§6. §>uasi-§)uotation 

IN DISCUSSING the modes of statement composition we are 
having continually to talk of expressions. Quotation suffices for 
the mention of any specific expression, such as ‘v’ or ‘ = ’ or ‘Jones 
is away’, but is not available when we want to speak generally of 
an unspecified expression of such and such kind. On such occasions 
use has been made of general locutions such as ‘a conditional’, 
‘the first component’, etc.; and more difficult cases have been 
managed indirectly by introducing a blank ‘-’ from time to 

1 Analogous reasoning appears in Huntington’s “Note on a Recent Set", p. 11. 

2 See also Tarski, “ Wahrheitsbegriff,” § 1. 

3 For further discussion and references see Carnap, Syntax , §§ 67-71. 
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time. But the developments to follow call for a more elastic 
method of referring to unspecified expressions. 

For the beginnings of such a method, the use of letters in algebra 
provides us with an adequate model. In algebra ‘ x’, ‘y’, etc. are 
used as names of unspecified numbers; we may suppose them re¬ 
placed by names of any specific numbers we choose. Analogously, 
Greek letters other than ‘e’, ‘ i’ , ‘X’ 1 will now be used as names of 
unspecified expressions; we may suppose them replaced by names 
(e.g. quotations) of any specific expressions we choose. 

A discussion of numbers may, for example, begin thus: 

(1) Let x be a factor of y. 

Throughout the discussion thus prefaced, we are to think of x and y 
as any specific numbers we like which satisfy the condition (1) — 
say the numbers 5 and 15, or 4 and 32. We are to think of the 
letters ‘ x’ and ‘y’ as if they were names of the numbers 5 and 15, or 
names of the numbers 4 and 32, etc. We are to imagine the letters 
‘x’ and ‘y’ replaced by the numerals (expressions) ‘5’ and ‘15’, or 
by‘4’and‘32’, etc. 

Similarly a discussion of expressions might begin thus: 

(2) Let y be part of v. 

Throughout the discussion thus prefaced, we are to think of y and 
v as any specific expressions we like which satisfy the condition 
(2) — say the expressions ‘York’ and ‘New York’, or ‘3’ and ‘32’. 
We are to think of the letters ‘y’ and ‘v’ as if they were names of 
the expressions ‘York’ and ‘New York’, or names of the expressions 
‘3’ and ‘32’, etc. We are to imagine the letters ‘ y’ and ‘v’ re¬ 
placed by the quotations ‘ ‘York’ ’ and ‘ ‘New York’ ’, or by ‘ ‘3’ ’ 
and “32”, etc. 

The reader is urged to compare the above short paragraph with 
the preceding one, word by word; also to review § 4. Roughly 
speaking, the letters ‘x’, ‘y’, etc. may be described as ambiguous 
numerals, ambiguous names of numbers, variables ambiguously 
designating numbers, or, in the usual technical phrase, variables 
taking numbers as their values. Correspondingly the letters 


1 These three letters are reserved for later purposes; cf. §§ 22, 35, 41. 



QUASI-QUOTATION 


35 


§6 


V\ V’, etc. may be roughly described as ambiguous quotations, 
ambiguous names of expressions, variables ambiguously designat¬ 
ing expressions, or variables taking expressions as their values. 
This does not mean simply that V and G’ take the place of ex¬ 
pressions, or are replaceable by expressions, for this is true of ‘x’ 
and ‘y’ as well. Rather, the letters ‘ p’ and ‘v’ take the place of 
quotations or other names of expressions, just as ‘x' and ‘y’ take 
the place of numerals or other names of numbers. 

Occasionally Greek letters will be used with accents or subscripts 
attached: ‘pi", ‘pi'", ‘pii’, ‘ ‘pi n \ etc. Such variants may be 
regarded simply as so many further Greek letters. Three Greek 
letters, V, and x\ together with their accented and sub¬ 
scripted variants, will be limited in their use to those cases where 
the expression designated is intended to be a statement. They 
serve as names of unspecified statements, and are replaceable by 
statement quotations or other names of specific statements. 1 

There is need also of a convenient way of speaking of specific con¬ 
texts of unspecified expressions: speaking, e.g., of the result of 
enclosing the unspecified expression pi in parentheses, or the result 
of joining the unspecified statements 4> and if/ in that order by the 
sign ‘ = ’. Note that quotation is not available here. The quo¬ 
tations: 

'G*)\ V-*’ 


designate only the specific expressions therein depicted, containing 
the specific Greek letters ‘pi’, ‘4>\ and ‘\{/’. Reference to the in¬ 
tended contexts of the unspecified expressions pi, </>, and ^ will be 
accomplished by a new notation of corners, thus: 


(3) r (nT, r <t> = 

Because of the close relationship which it bears to quotation, 
this device may be called quasi-quotation. It amounts to quoting 
the constant contextual backgrounds, ‘( )’ and ‘ = ’, and im¬ 

agining the unspecified expressions pi, <j>, and written in the 
blanks. If in particular we take the expression ‘Jones’ as pi, 


1 The three letters have indeed already appeared in this use in § S, where the 
sense intended was apparent. 
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‘Jones is away’ as <j>, and ‘Smith is ill’ as yp, then r (^) n is ‘(Jones)’ 
and r 4> = is ‘Jones is away = Smith is ill’. 

The quasi-quotations (3) are synonymous with the following 
verbal descriptions: 

The result of writing ‘(’ and then pi and then ‘) ’, 

The result of writing <j> and then ‘ = ’ and then yp; 

or, equivalently: 

The result of putting pi in the blank of ‘( )’, 

The result of putting <p and p in the respective blanks of 

t _ > . 

- > 

or, equivalently: 

The result of putting pi for ‘pi’ in ‘(pi)’, 

The result of putting </> for ‘<f>’ and p for ‘p’ in ‘<f> = p’. 

We may translate any quasi-quotation: 

r _ -y 

into words in corresponding fashion: 

The result of putting pi for ‘pi’, v for ‘v’, . . . , <p for ‘<p’, yp for 

f i * * i > 

\p .in -. 

Described in another way: a quasi-quotation designates that (un¬ 
specified) expression which is obtained from the contents of the 
corners by replacing the Greek letters (other than ‘e’, V, ‘X’) by the 
(unspecified) expressions which they designate. 

When a Greek letter stands alone in corners, quasi-quotation is 
vacuous: r pi 1 is pi. For, by the foregoing general description, r pi 1 is 
the result of putting pi for ‘pi’ in ‘pi’; r pP is what the letter ‘pi’ 
becomes when that letter itself is replaced by the (unspecified) 
expression pi-, in other words, r pi l is simply that expression pi. 

Quasi-quotation would have been convenient at earlier points, 
but was withheld for fear of obscuring fundamentals with excess 
machinery. Now, however, it may be a useful exercise to re¬ 
capitulate some sample points from §§ 1-5 in terms of this device. 
A conjunction r <p • \P 1 is true just in case <j> and yp are both true, and 
an alternation r <j> v yp 1 is false just in case <p and yp are both false. 
A conditional r <f> 3 'P'' is true if 4> is false or yp true, and false if <p is 
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true and \p false. A biconditional r <j> = \}P is true just in case <j> and 
^ are alike in truth value. A denial </f is true just in case <t> is 
false. <j> logically implies \j/ or is logically equivalent to 4 according 
as r <£ 3 if 1 or r 4> = 4P is logically true, and <£ materially implies if/ or 
is materially equivalent to ^ according as r <t> 3 i P or r (f> = 4^ is 
true. 


§ 7. Parentheses and Dots 

PARENTHESES, taken for granted thus far as an auxiliary nota¬ 
tion, are most simply construed as forming integral parts of the 
binary connectives. The notations of conjunction, alternation, the 
conditional, and the biconditional will no longer be regarded as 
r <t> . ifP, r 4> v \p, r 4> 3 jfP, and r <f> = \P, but as r (<f> . r (tf> V ^)A 
r (<£ 3 \py, and r (4> = i py. But the notation of denial remains simply 
r '~ <y. This formulation yields just the usage of parentheses 
which we have hitherto followed, except in one respect: a con¬ 
junction, alternation, conditional, or biconditional comes now to 
bear an outside pair of parentheses even when it stands apart from 
any further symbolic context. Such outside parentheses have 
hitherto been omitted. 

When the binary modes of composition are thus construed, no 
auxiliary technique of grouping is needed. The syntactical sim¬ 
plicity thus gained proves useful in certain abstract studies (e.g. 
§ 10; also later chapters). In applications, however, such sim¬ 
plicity is less important than facility of reading; and excess of 
parentheses is a hindrance, for we have to count them off in pairs 
to know which ones are mates. It is hence convenient in practice 
to omit the outside parentheses as hitherto, and furthermore to 
suppress most of the remaining parentheses in favor of a more 
graphic notation of dots. 

Parentheses mark the outer limits of a binary compound; dots 
determine those same limits less directly. Roughly speaking, a 
group of dots placed beside ‘v’, ‘3’, or ‘ = ’ indicates that the com¬ 
ponent on that side has its other end at the nearest larger group of 
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dots, if any, and otherwise at the limit of the whole symbolic con¬ 
text. E.g., since ‘ = ’ in 

r 4> v: D x ■ = • • 3- ’A 3 </> n 

has just one dot prefixed, the first component of the biconditional 
begins at the last previous group of two or more dots — hence, as it 
happens, at the group of two. Again, since ‘ = ’ has no dots suffixed, 
the second component of the biconditional stops as soon as it meets 
any dots at all — hence at the group of three. The components of 
the biconditional are thus ' \p D x" 1 and 4>- The alternation has 
r \pD x ■= as second component, for this runs from the pair of 
dots to the first larger group. The conditional corresponding to 
the second occurrence of O’ has r <f> v: ^O x ■= ’f' in its entirety 
as antecedent, since the group of dots prefixed to the second oc¬ 
currence of O’ exceeds any previous group. Similarly that con¬ 
ditional has r ^O as consequent. The whole would appear in 
terms of parentheses as 

r (0 v (($ D x) = 4)) D (\f> 3 <t>)V- 

Dots serve in this fashion to reinforce the connectives *v’, O’, 
and ‘s’. In the case of conjunction, already represented by a dot, 
such reinforcement will be accomplished simply by using a group 
of dots instead of the one; thus the conjunction of <f> with 
r \j/ v x O cp 1 appears as r <p : \p v x D </> n . Use of dots for conjunction 
is distinguishable from the other uses by the absence of any ad¬ 
jacent connective ‘v’, O’, or ‘ = ’. A group of n dots standing as a 
conjunction sign will be regarded as indicating a smaller break than 
that indicated by a group of n dots alongside ‘v’, O’, or ‘ = ’, 
though of course a greater break than that indicated by n -1 dots. 
Thus, in 

r 4> ■ i -3 x : ^ v (t> := ->C 

the conjunction indicated by the single dot has <j> and \p as com¬ 
ponents; the conjunction indicated by the pair of dots has 
r <p . \p D x n and r \p V <fP as components; the conditional has r 4> . \jP 
and x as components; and the biconditional has r <j> .ip .D x ■ 4 1 v <t’~' 
and x- The whole would appear in terms of parentheses as: 
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r (((0 ■ ) 3 x) ■ v <j>)) = xT- 

A place occupied by dots will be called a. joint; also, if ‘v’, ‘3’, or 
‘ = ’ lacks dots on one side, the narrow blank place on that side will 
be called a joint. One joint will be said to be looser than another if 
it contains more dots than the other, or if it contains just as many 
dots as the other and is adjacent to ‘v’, ‘3’, or ‘ = ’ while the other 
is not. Now the way to read dots can be summed up thus, subject 
to revisions in the immediate sequel: (I) The second component 
of a conjunction, alternation, conditional, or biconditional ends at 
the first subsequent joint, if any, which is looser than the one at its 
beginning; otherwise it runs to the end of the whole symbolic context. 
The first component begins at the last previous joint, if any, which is 
looser than the one at its end; otherwise it begins at the beginning of 
the whole context. 

Allowance must be made now for the survival of parentheses in 
the midst of the dot notation, for it is convenient to be able to mix 
the two notations at will. After a denial sign, indeed, a compound 
will always retain its outer parentheses, even in dot-strewn con¬ 
texts. E.g., we write r ~ (4> 3. v x) n ; never r ~ : <f> D. \p v x n - 
Dots may or may not be used to reinforce a binary connective, but 
they will never be used to reinforce ‘ ~ ’. Now the presence of 
parentheses obliges us to qualify (I) so that joints sealed up in 
parentheses will not be made to affect the grouping of outside 
text. E.g., the pair of dots in 

r <t> = . v ~ (cj> 3. \p . x : 3 xr 

must not be made to terminate the second component of the bicon¬ 
ditional; that component is supposed to run to the end of the 
whole expression. Similarly for the second component of the 
alternation. (I) can be suitably amended by adding the obvious 
requirement that a component must contain no dangling paren¬ 
thesis; it must contain equal numbers of left- and right-hand 
parentheses. 

The presence of parentheses also has another effect: if a com¬ 
ponent of a binary compound has its one end inside some parenthe¬ 
sized expression, it may be bounded at its other end by the limit 
of that parenthesized expression rather than by a joint. In the 
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case of a second component, such an ending will be forced by any 
right-hand parenthesis whose mate did not intervene — hence by 
any right-hand parenthesis which may turn up, so long as the com¬ 
ponent already contains equal numbers of left- and right-hand 
parentheses. The case of a first component is analogous. 

In view of these considerations, (I) gives way to the following 
final formulation. (II) The components of a conjunction, alterna¬ 
tion, conditional, or biconditional are as short as these conditions 
allow: each component contains just as many left- as right-hand 
parentheses; the second component ends either at the end of the whole 
symbolic context or else just before a right-hand parenthesis or a joint 
looser than the one at its beginning; and the first component begins 
either at the beginning of the whole context or else just after a left-hand 
parenthesis or a joint looser than the one at its end. 

Note that (II) compels a denial sign to apply, as usual, to the 
shortest statement following it. It compels interpretation of 
r ~ cj>vyp n as the alternation of r ~ <jT and \p, interpretation of 
r ~ (<t> D. \f/ v x) — x n as the biconditional of r ~ (<j> D. v x)" 1 and 
X, and so on. For, according to (II), the first component of the 
alternation or biconditional must begin just after a joint or a left- 
hand parenthesis or at the beginning of the whole context; it 
cannot begin just after 

As appears from the examples, the use of dots proceeds without 
regard to the complexity of the unspecified statements denoted by 
the Greek letters. We write ‘ r <j> D. v x n ’> not ‘ r <t> .:0. v x n ’> 
even though <f> be thought of in particular as a statement complex 
enough to contain say a group of two dots in turn. This practice 
can be justified by thinking of the statements indicated by Greek 
letters as constructed always with parentheses rather than dots, 
to the extent at least of bearing their outermost parentheses. 

The parenthesis notation formulated at the beginning of the 
section is retained, for the sake of theoretical developments, as the 
“official” notation; dots enter only as shorthand. This unofficial 
use of the dot notation has its justification in the fact that the 
parenthesis notation can easily be restored whenever we like. To 
accomplish this in any given case we have only to put parentheses 
around each expression n which satisfies the following conditions 
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(i) and (ii), and then drop all dots except single dots for conjunc¬ 
tion. 

(i) y is a component of a conjunction, alternation, conditional, 
or biconditional, according to (II), or else it exhausts the whole 
symbolic context; 

(ii) y contains a binary connective which is unparenthesized in 
lu; i.e., one which is preceded in y by no more left- than right-hand 
parentheses. 

Note now, in preparation for one further notational departure, 
that the grouping in an iterated conjunction is always immaterial; 
r 4> . : x" 1 and r <j> . x" 1 , for example, are equivalent. Any iterated 

conjunction of fa, fa, . . . , and <j> n , regardless of the grouping, is 
true in just one case: the case where <j> i, <p 2 , . . . , and <j> n are all 
true. For, a conjunction is true just in case both its immediate 
components are true; either component, if in turn a conjunction, 
is true just in case both its components are true; and so on. The 
grouping in an iterated alternation is likewise immaterial; every 
iterated alternation of fa, fa, . . . , and <j>„ is true in the same cases 
— all cases except the one where fa, fa • • • , and fa are all false. 
For, an alternation is false just in case both components are false; 
either component, if in turn an alternation, is false just in case both 
its components are false; and so on. Because of these circum¬ 
stances it is sometimes convenient to use uniform connectives 
throughout the iteration — thus writing 

r fa . fa . fa . fa' 1 , r fa v fa v fa v . . . v fa - ', 

also r <j) D fa .v. \f/ D fa .v. x 3 x' n , 

etc., without further indication of grouping. One or another 
arbitrary indication of grouping must be restored, however, before 
analyzing such a compound along the lines prescribed in the fore¬ 
going paragraphs. 

The indication of grouping by means of dots dates from Peano. In details the 
usage varies from author to author. Curry’s usage (“Dots as Brackets”) is neater 
than that set forth above, but not as easy to read. Since parentheses are so much 
simpler in theory, I have preferred to retain them as the “official” notation and 
then to frame dot conventions with an eye only to perspicuity and approximate 
conformity to prevalent usage. The syntactical simplicity of the “official” nota¬ 
tion could have been further enhanced, indeed, by writing binary compounds in the 
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fashion r V <j> r D</)i/^ n , etc. instead of r (<f> V r (<f> D i/')" 1 , etc.; under this 

rearrangement, as Lukasiewicz has remarked (“Untersuchungen/’ p. 31, reprint 
p. 3), grouping is unambiguously discernible without help of parentheses or other 
punctuation. 

§8. Reduction to Three Primitives 

THE TRUTH tables of singulary and binary modes of composition 
have the forms depicted respectively in Tables 1 and 2. Each way 
of putting ‘T’s and ‘F’s for the question marks gives us a different 
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ternary mode of composition, or mode of combining statements 
three at a time, our table would have a column for each of the 
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hood on the part of n components p 1 , fa,... , P n (cf. § 3), and the 
last column specifies the truth value of the compound for the 
successive cases. Each of the 2 2 .ways of replacing the question 
marks by ‘T’ or ‘F’ determines a different «-ary mode of compo¬ 
sition. 

Of the 2 2 (=4) singulary modes thus determined, we have con¬ 
sidered only one: denial. Of the 2 22 (= 16) binary modes we have 
considered only four. The 2 s3 (= 256) ternary modes, the 2 21 
(= 65,536) quaternary ones, and so on, all remain untouched. 
Actually, however, the few modes already considered are all we 
ever need; for it will now be shown that every mode of composition 
descrihable by a truth table is translatable into terms of denial, con¬ 
junction, and alternation 3 In the next section a still more striking 
reduction will be made. 

Case 1 of Table 3 is the case where <j> i, fa, p 3 , . . . , and p n are all 
true; in other words, it is the case where 

r 01 .02 ■ fa . Pd 

is true (cf. § 7). Let us call this conjunction xi- Again, Case 2 is 
the case where pi is false and fa, fa,. . ., and p n are true; in other 
words, it is the case where r ~ fafa, fa,... , and p n are all true; 
in other words, it is the case where 

r ~ fa . fa .fa . pd 

is true. Let us call this conjunction X 2 . Similarly Case 3 is the 
case where 

f fa • ~<fo . fa . pd 

is true; let us call this xa- Case 4 is the case where 
Pi . ~ <£2 ■ fa . Pd' 

is true; let us call this X 4 - In this fashion Case i, for each i from 
1 to 2 ™, is allotted a statement x; which is true in just that case and 
is composed of pi, fa, fa, ...,<#>„ by means of nothing more than 
conjunction and denial. (In the trivial situation where n is 1 , 
conjunction of course drops out; xi is Pi and X 2 is r ~ ^i 1 .) 

Now every truth table is formed from Table 3 (for appropriate 
n) by assigning ‘T’ and ‘F’ to the successive cases, in place of the 
1 This was first shown by Post. 
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question marks. But if it is formed by assigning ‘T’ to just one 
case, say Case i, and ‘F’ to the rest, then it is the table for x;! for 
X, is true in Case i and false in the others. If on the other hand it is 
formed by assigning ‘T’ to many cases, say Cases i, j, k,, 
then it is the table for 

r x, V X; V Xk V - . . n ; 

for this alternation is true wherever any of x;> X/> x*> • ■ . is true 
(cf. § 7), hence in Cases i, j, k, . . .. If finally the table is foimed 
by assigning ‘F’ to all cases, then it is the table for any trivial con¬ 
junction such as 

r 4> 1 ■ ~<t >1 ■ fa • ■ ■ ■ ■ 4>r?\ 

for there is no case where 4 >i and r ~ <P? are both true, hence no 
case where this conjunction is true. Now since 

X;> r Xi y Xy v X* v . . r 4 >i ■ ~4>i .fa . <j>? 

are all compounded of fa, fa, ... , 4>„ by means only of denial, 
conjunction, and alternation, we see that every truth table de¬ 
scribes a mode of composition involving nothing beyond those 
three. All truth-functional modes of composition are thus trans¬ 
latable into terms of denial, conjunction, and alternation. We can 
take these as the sole basic or primitive truth-functional modes of 
composition, and construe all others merely as results of repeated 
application of the primitive modes. 

Given the truth table of any mode of composition, the method of 
translating that mode into terms of the three primitives is apparent 
from the foregoing proof; we have only to form the conjunction 
X„ or the alternation of the conjunctions x,> X>'> X*> • • • > corre¬ 
sponding to the case or cases marked with ‘T’ in the table. If 
there is no ‘T’, we resort to the trivial device of forming a con¬ 
junction containing fa and r ~ fa' 1 . As examples, consider the 
conditional and the biconditional. Here n is 2; xn X 2 , X 3 > and X 4 
become respectively r fa . fa?, r ~ fa . <f>?, r <j >i . ~ fa?, and fa . 
~ (j>?, or better . fa 1 , r ~ 4>. fa, r 4> . ~ fa, and r ~ <j> . ~ fa. 
Now the conditional has in its table three ‘T’s, marking Cases 
1, 2, and 4; hence r <f> D fa is expressible as r xi v X 2 v X*\ i-e., 

r <p . \j/ .V. <j> . ^ «V. <f> . \j/^. 
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Again, the biconditional has in its table two ‘T’s, marking Cases 
1 and 4; hence r 0 = jP is expressible as r xi V xT, i.e., 

F (f) . \p .V, (f> . 1 \jP, 

The method always gives us a translation into terms of denial, 
conjunction, and alternation, but it does not always happen to give 
the shortest. For example, r 0 D jP admits also of the shorter 
translations r ~ 0 v jp and r ~ (0. ~ 0)" 1 , as observed in § 2. 

It is trivial but perhaps instructive to apply the method 
also to the singulary modes of composition. Here xi and X 2 ate 
simply 0i and 0f; or, dropping the subscript, 0 and r ~ <fp. 
Now one of the four singulary modes assigns ‘T’ to Cases 1 and 2 
alike; it is thus expressible as r xi v X 2 n j or r 0 v ~ 0 n . Another 
assigns ‘T’ to Case 1 alone; hence it is expressible as xn °r simply 
0. A third assigns ‘T’ to Case 2 alone; hence it is expressible as 
X 2 , or r ~ 0 n . The remaining singulary mode assigns ‘F’ to both 
cases; hence, by the device usual in the absence of ‘T’s, it is ex¬ 
pressible as r 0 . ~ 0~\ 

§9. Reduction to One Primitive 

WE NOW turn to the binary mode of statement composition em¬ 
bodied in the connective ‘neither-nor’. This will be called joint 
denial , and rendered symbolically in the notation r (0 j 0)" 1 . A 
joint denial, e.g. ‘Neither is Jones away nor is Smith ill’, sym¬ 
bolically : 

(Jones is away j. Smith is ill), 

is true just in case its components are both false. This gives us the 
following table: 

</> * otitr 

T T F 

FT F 
T F F 

F F T 

Since r (0 J 0) n is true just in the case where r '~ 0" 1 and r ~ jP are 
both true, it is translatable into terms of denial and conjunction 
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as r (~ <(>. ~ \py. Again, since it is true just in the case where 
r (<j> V \py is false, it is translatable into terms of denial and alterna¬ 
tion as (4> v ^)“ l . The fact that r (4> J, denies r (4> v \p)~' is re¬ 
flected in ordinary language, indeed, by the cancellatory ‘n’ which 
turns ‘either-or’ into ‘neither-nor’. The vertical mark in ‘ j, ’ may 
be thought of as having the same cancellatory effect upon the sign 
‘v’; it is like the vertical mark m the inequality sign ‘4-’ of 
arithmetic. 

With denial, conjunction, and alternation at hand, a special 
notation for joint denial is superfluous. But it is introduced for 
the sake of translation in the opposite direction; for it turns out 
that denial, conjunction, and alternation can all be expressed in 
terms of joint denial alone —cjP as r (4> I 4>) n > r (<t> • '1'V as 
r (0 I <t>) i (t I lWr> and r i<t> v \py as j \p) J, (</> | ^)) n . This is 
seen by constructing tables according to the method of § 3: 


r (4-1 <t>r 
TFT 
FTF 


(( <t> i <t>) i ('Pi 'P)) 
TFT T TFT 
FTF F TFT 
TFT F FTF 
FTF F FTF 


((<P i i) I O ! 'P)) 

TFT T TFT 
FFT T FFT 
TFF T TFF 
FTF F FTF 


Comparison of the middle columns of these tables with the tables 
of denial, conjunction, and alternation shows that r (0 J, <t>y is true 
in just the case where r ~ is true, that r ((4> I 4>) I (P I is 
true in just the case where r (<j>. p) 1 is true, and that r ((0 j p) j 
(<f> [ p)y is true in just the cases where r (<£ V p) n is true. 

Since all truth-functional modes of composition are translatable 
into terms of denial, conjunction, and alternation (cf. § 8), the 
translation in turn of these three into terms of joint denial shows 
that all the truth-functional modes of composition are translatable 
into terms of joint denial alone. Iterated application of this one 
mode of composition suffices for expressing anything that could be 
expressed by any of the others. We can take joint denial as the 
sole primitive truth-functional mode of composition, and construe 
all the others merely as results of applying this primitive mode in 
iteration. 
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In theoretical developments (e.g. §§ 10, 18) it is convenient to 
be able to treat ‘j’ or ‘neither-nor’ thus as the sole truth-functional 
connective. But in applications it is convenient to have the signs 
V, ‘v’, ‘D’, and ‘ = ’ as well, for the brevity and clarity 
which they afford. Adoption of as sole connective demands 
continual use of compounds so cumbersome, indeed, that conven¬ 
tions of shorthand reducing them to manageable length would 
become a practical necessity. 

Between the theoretical advantage of a single connective and the 
practical advantage of a multiplicity, this idea of shorthand effects 
a complete reconciliation. By way of basic or primitive notation 
we may adopt the form of notation r (4> J, but we can still in¬ 
troduce the convenient forms of notation r ~ 0" 1 , r (<j>. r (<£ v if/p, 

etc., simply as shorthand abbreviations for the appropriate com¬ 
plexes involving 1 . Thus, where <f> is any statement, r ~ <jp will 

be construed as an abbreviation of r (<j> j. <£) n . This convention of 
abbreviation will be referred to as Dl. 

Dl. r ~ <fP for r (<f> l <j>)~. 

Similarly, where <t> and \p are any statements, r (<t>. f) 1 and r (0 v \py 
will be regarded as abbreviations of [ </>) J. (f j. and 
r ((<f> 1 f) I (0 i V'))"'• More briefly, we may explain r (<f>. if/p and 
r (<j> V as abbreviations of r (~ <j> J, ~ and r ~ ( <j> j 
for, Dl explains r (~ <t> j — fy and r ~ (</> j i py as abbreviations 
in turn of r ((tf> | <t>) i (lA j and r ((0 | \f) J. (</> j i)y. Our next 
two conventions of abbreviation are hence as follows: 

D2. r O . VP for r (~ <t> | — 

D3. r (<t> v rpy for r ~ ( <t> J, \py. 

Such conventions of abbreviation are called formal definitions; 
hence the use of ‘D’ in numbering them. To define a sign formally 
is to adopt it as shorthand for some form of notation already at 
hand. If the sign has a preconceived meaning, as in the present 
instances, and the definition suits that meaning, then the definition 
amounts to an elimination: it shows that the sign is dispensable in 
favor of those occurring in the definition. To define a sign is to 
show how to avoid it. 

The relationships utilized for the definitions Dl-3 were shown 
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by truth tables to follow from the meanings hitherto assigned to 
‘v’, and ‘ j ’. But they are readily checked also on the 
linguistic level. ‘Jones is not away’ has a prolix but obviously 
accurate equivalent in ‘Neither is Jones away nor is Jones away’; 
so has ‘Jones is away and Smith is ill’ in ‘Neither is Jones not 
away nor is Smith not ill’; and ‘Jones is away or Smith is ill’ 
amounts obviously to the denial of ‘Neither is Jones away nor is 
Smith ill’. 

Definitions of r (<j> 3 tV and r (<f> = tV are apparent from earlier 
observations (§§ 2, 3): 

D4. r (0 3 tV for r (~ t v tV> 

D5. r 0 = tV f° r r ((<t> 3 t) ■ it 3 4>)V- 

r (<t> 3 tV and r (4> = tV are defined ultimately in terms of joint 
denial, by virtue of these two definitions together with their pred¬ 
ecessors. D4, for example, defines r {<f> 3 tV as r ~ (~ 4> J. tV 
in view of D3, and hence as 

r (((4> i t) 1 1) I (it 11) i t)V 

in view of Dl. 

As stated earlier (§ 7), indications of grouping in an iterated 
■conjunction or alternation will often be suppressed in view of the 
fact that differences in this respect are immaterial to truth value. 
Let us now give this procedure the status of definitional abbrevi¬ 
ation, by construing the ungrouped conjunction or alternation as 
shorthand for a conjunction or alternation grouped in some one 
arbitrary fashion. The following definitions will serve. 

D 6 . r (<£i . <h . <hV f or r ((ti ■ <h:)’<foVi 

r {ti ■ <h -<h • t*V f or r (((4 >i • <h) • <h) • t*V> etc - 

D7. r (<j> i vi^v faV f 0T r ((ti v <fe) v <j> 3 )" 1 , 

r {ti v <h v v <j>iV f or v <h) v <h) v 4>iV> M- 

Over the shorthand introduced by Dl-7, further shorthand is 
superimposed by the technique of using dots instead of parentheses 
(§ 7). That technique will not be formulated in a numbered 
definition, but it has the same status. 

Joint denial is not the only truth-functional mode of composition 
which is adequate to definition of all the rest. The same is true of 
alternative denial, which is described by the following truth table: 
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<t> 0 r (<j> | fy 

T T F 

F T T 

TFT 
F F T 

Just as r (0 l 4'V is equivalent to r (~ <j>. ~ 0)" 1 and to r ~ (0 v 0 ) -1 , 
so r (0 | 0 ) n is equivalent to r (~ 0 v ~ 0 )~ l and to ( 0 . 0 )T 
Now denial, conjunction, and alternation can all be translated into 
terms of this one mode of composition, as follows : 1 0" 1 as 
--(0 | 0 T, r ( 0 . 0 T as. r ((0 | 0) | (0 | 0 ))", and r (0 v 0 )" as 
r ((0 | 0 ) | (0 | 0 ))T This can be checked by tables just as was 
done in the case of joint denial. Instead of adopting joint denial 
as our primitive mode of composition, then, we might just as well 
have chosen alternative denial. Dl would then give way to: 

DT. r ~ 0 -' for r (0 | <t>y. 

Similarly r (0.0)’’ and r (0 V 0 ) -1 would be introduced as abbrevi¬ 
ations of r ((0 | 0 ) | (0 | 0 ))~* and r ((0 | 0 ) | (0 | 0))’’. Simplified 
by application of the abbreviation Dl', these definitions appear as 
follows: 

D 2 '. r (0.0) n for r ~( 0 | 0 T, 

D3'. r (0 v 0 T for r (~ 0 | ~ 0 )^. 

Choice between this course and the previous one is a matter of 
indifference. But there is no third course affording equal economy; 
it can be shown that the truth-functional modes of composition are 
not all reducible to any binary one other than joint and alternative 
denial . 2 

1 The definability of denial, conjunction, and alternation in terms of joint denial 
was first pointed out by Sheffer in 1913; and similarly for alternative denial. The 
adequacy of joint denial was known to Peirce in 1880, and both facts were known to 
him in 1902; but his notes on the subject remained unpublished until 1933 (4.12, 
4.264). 

2 Cf. Zylinski. 
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§ 10. Tautology 

THUS FAR in our study of the truth-functional modes of com¬ 
position we have devoted our attention to concepts and notations. 
We turn now to principles, logical truths. The study of the logical 
truths at this level proves to be a trivial one, and will not detain 
us long. 

Statements which are true by virtue solely of the truth-functional 
modes of composition will be called tautologous; they involve only 
the truth-functional notations essentially, all else vacuously (cf. 
Introduction). If we think of all such notations as merely abbre¬ 
viating iterated joint denials, according to Dl-5 (§ 9), then we 
may describe a tautologous statement as one which involves only 
‘ l ’ (better, ‘ ( J. ) ’) essentially. All tautologous statements 
are logically true, but not all logically true statements are tau¬ 
tologous. The method of showing a statement to be tautologous 
consists merely of constructing a table under it in the usual way 
(§ 3) and observing that the column under the main connective is 
composed entirely of ‘T’s. When this is the case we know that the 
statement is true by virtue solely of its truth-functional structure 
and independently of the ultimate constituent statements. 

For example, where <p, and x are any statements, r <p D \p . \p 3 
X .V. x 3 is found to be tautologous by constructing Table 1 and 
observing its eighth column; r <p 3 \p .3 <j> := <jp is found tautologous 


ct> 3 \p . i 3 x 

.V. 

x 3 yf 1 




r <£ V 

TTTTT TT 

T 

TTT 




T T F T 

F T T T T T T 

T 

T F F 

r 4> 3 \p 

.3 </, 

A' 

FTT F 

TFFFFTT 

T 

TTT 

TTT 

TT 

TT 

TABLE 3 

FTFTFTT 

T 

T F F 

FTT 

F F 

T F 


TTT FT F F 

T 

FTT 

T F F 

TT 

TT 


FTT F T F F 

T 

FT F 

FT F 

F F 

T F 

r ~ ( <p . ~ <p) 

TFFFFTF 

T 

FTT 

TABLE 2 


T T F F T 

FTFTFTF 

T 

F T F 




T F F T F 

TABLE 1 
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by constructing Table 2 and observing its sixth column; r 0 v ~ 0" 1 
is found tautologous by constructing Table 3 and observing its 
second column; and ( 0 . ~ 0)" 1 is found tautologous by con¬ 
structing Table 4 and observing its first column. 

r 0 v ~ 0 n illustrates the law of the excluded middle, which is com¬ 
monly phrased as saying that every statement is true or false; and 
r ~ (0 . ~ 0 ) n illustrates the law of contradiction — no statement is 
both true and false. These two laws are not to be identified with 
r 0 v ~ 0" 1 and r ~ ( 0 . ~ 0 ) n themselves; the latter are un¬ 
specified statements pending specification of 0 , and they become 
the minor truths ‘Jones is ill v —'Jones is ill’ and (Jones is 
ill. ~ Jones is ill)’ when 0 is specified as ‘Jones is ill’. But the 
law of the excluded middle may be formulated as saying that 
r 0 v ~ <jp is true for every statement 0 , and the law of contra¬ 
diction correspondingly. 

For any statement 0 , the compounds r 0 v ~ 0 n and r ~ ( 0 . 
~ <t>)~' are obviously and trivially true. Many longer tautologous 
statements are equally obvious, e.g. 

r (p = ip .=. if, = (/P, <~<j> 3 \p . f 3 x .3. <t> 3 x n - 

This is not, however, characteristic of all; the ones established in 
Tables 1 and 2 , e.g., are far from obvious. But if tautologous state¬ 
ments are not in general obvious, at least they are essentially 
trivial, in that they can be proved by the simple mechanical ex¬ 
pedient of truth tables. 

When the truth tables produce a solid column of ‘T’s under the 
main connective of a statement, we know the statement is tautolo¬ 
gous. But note that the appearance of an ‘F’ in the column is not, 
conversely, a proof that the statement is not tautologous. A table 
for r x 3 0 ■= 0 n , e.g., indicates falsehood in the fourth case: 

r x 3 4>. = 

FTF F F 

Yet, as Table 2 shows, r x3^.= </P will still be tautologous if x 
happens to be r 0 3 0 " 1 . In general, though the truth-table test may 
give a negative result when a statement is analyzed into com¬ 
ponents 0 i, 02 , ... , 0 n , analysis into shorter components 0 'i, 0 ' 2 , 
. . . , 0 ' n « may still show the statement to be tautologous. The 
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occurrence of an ‘F’ in the column under the main connective does 
not disprove that a statement is tautologous, unless we know that 
we have analyzed the statement into its ultimate truth-functional 
components and thus laid bare the whole of its truth-functional 
structure. 

Precise formulation of this notion of ultimate truth-functional 
components is facilitated by taking joint denial as our primitive 
truth-functional mode of statement composition and thinking of 
‘v\ ‘3’, and ‘ = ’ as merely an unofficial shorthand 
eliminable by Dl-5. 1 The immediate truth-functional components 
of <j>, now, are to be understood as the \p and x (if such there be) 
whose joint denial r (i p [ x)" 1 is <f>. The truth-functional components 
of <p, more generally, are to be understood as comprising <p itself, 
together with, its immediate truth-functional components (if any), 
together with the immediate truth-functional components of those 
immediate truth-functional components (if any), and so on. The 
ultimate truth-functional components of 4>, finally, are those 
truth-functional components of cj> which are not joint denials in 
turn. 

Now a tautologous statement is describable as one which proves 
true by the truth-table method under every assignment of truth 
values to its ultimate truth-functional components. This de¬ 
scription of tautology is more explicit and mechanical than the 
earlier description in terms of essential occurrence, but the truth- 
table method to which it refers could do with more explicit charac¬ 
terization in turn. 

As we have seen, there are 2 n possible distributions of truth 
values to the ultimate truth-functional components <f> i, ft 
of <p; the first distribution marks all as true, the second marks <f>i 
false and the rest true, the third marks 0 2 false and the rest true, 
the fourth marks and </> 2 false and the rest true, and so on. Con¬ 
sultation of the truth tables of ‘ [ ’, ‘v’, ‘3’, and ‘ = ’ 

enables us, under each one of these 2 n distributions, to derive truth 
values step by step for more and more complex truth-functional 
components of <p until finally <p itself is marked as true or as false. 

1 The ensuing developments would proceed analogously if alternative denial 
were taken as primitive instead of joint denial. 
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For each i from 1 to 2", thus, there is a definite set of truth-func¬ 
tional components of 4> which will be marked true; let us call this 
set the ith truth set of <j>. It comprises those of <j>i, fa, . . . , fa which 
are marked true under the ith distribution of truth values, together 
with all those more complex components of <f> which come deriv¬ 
atively to be marked true by consultation of the tables of ‘ J, 
V, ‘v’, ‘D’, and ‘ = Described in terms of the tabular 
constructions, as of Tables 1-4, the ith truth set comprises those 
statements which are marked ‘T’ in the ith row. The third row of 
Table 2 shows, e.g., that the third truth set of r <j> D .D <f> := fa 1 
embraces <t>, r <j> D D fa 1 , and r <£ D .D <f> := fa 1 (if we suppose that 
<j> and \p here are the ultimate truth-functional components). 

Towards a more rigorous characterization of this notion of truth 
set, let us take ‘ [ ’ again as our primitive connective and think of 
‘v’, ‘D’, and ‘ = ’ as mere shorthand. Now a truth set 
S of <f> will include, to begin with, anywhere from all to none of the 
ultimate truth-functional components fa, fa, ... , <j> n of <j> (depend¬ 
ing on whether S is the first, second, . . . , or 2"th truth set). From 
among the truth-functional components of <f> having the form 
r 4>i i 4>P i next, we decide which ones accrue to S by consulting the 
joint-denial table: 

r 4>i i <i>? 

TFT 
F F T 
T F F 
F T F 

In view of this table, we accord r fa j. <j>p to S if and only if S in¬ 
cludes neither fa nor fa. Similarly for the succeeding levels of 
complexity: any truth-functional component r (fa j fa) J, fa n or 
(fa 1 faV of 4> will belong to S if and only if neither of 
the parts r fa J, 4>p and fa belongs to S; likewise any truth-func¬ 
tional component r (fa j fa) j (fa { fa) 1 of <f> will belong to S if and 
only if neither of the parts r (fa J. faj 1 and r (fa j fa) 1 belongs to S; 
and so on. In general, (I) any joint denial r \p j x" 1 which is a truth- 
functional component of <f> will belong to S if and only if neither nor 
X belongs to S. From among the ultimate truth-functional com- 
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ponents 4>i, fa, , . . , 4>„ of tj >, a truth set S may contain any arbitrary 
selection (there being a different truth set for each such selection); 
but once this part of S is fixed, the rest of the membership of S is 
fixed by (I). Thus a truth set of 4> is describable simply as any set S 
of truth-functional components of 4> which conforms to (I). 

To say that <f> proves true by the truth-table method under each 
assignment of truth values to its ultimate truth-functional com¬ 
ponents is merely to say that <j> belongs to each of its truth sets, 
since the successive truth sets of 4> comprise those components of <fr 
which are marked ‘T’ in the successive rows of the truth table. 
We thus arrive at the following formulation of the notion of tau¬ 
tology: <t> is tautologous if and only if it belongs to each of its truth 
sets. 

When a statement is written out in full, with all the abbrevi¬ 
ations ‘v’, ‘D’, and ‘ = ’ eliminated in favor of ‘ J. ’, the 

writing of rows of ‘T’s and ‘F’s under the statement in usual truth- 
table fashion is merely a way of recording the successive truth sets. 
Thus it is that solid ‘T’s, in the column corresponding to the whole 
statement, establish tautology. In practice, of course, we do not 
eliminate the abbreviations ‘v’, ‘3’, and ‘ = ’; in con¬ 

structing the truth table of a given statement we thus consult not 
only the table of ‘ J. ’ but also those of etc. This is just a 

shortcut, based on the consideration that the tables of ‘~’, 
etc. record the net results of expanding ‘~’, etc. into terms of 
‘ | ’ and applying the table of ‘ [ (cf. § 9). 

Often a statement is not given explicitly, but is described only 
incompletely in terms of unspecified components designated by 
Greek letters V, etc. Such is the situation confronting us in 
Tables 1-4. Under these circumstances we do not know whether 
4>, \j/, etc. are ultimate components; consequently, as remarked 
earlier, the presence of an ‘F’ in the main column of the table will 
not rule out the possibility that the statement is tautologous. It is 
important to note, however, that the opposite sort of reasoning 
does hold: when the main column exhibits ‘T’ throughout we can 
conclude that the statement is tautologous regardless of whether 
the Greek letters refer to ultimate components. Consider, e.g., 
Table 1. The ‘T’ in the first row and main column shows that the 
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whole statement will belong to all truth sets which contain 4>, \f/, 
and x; the ‘T’ in the second row and main column shows that the 
whole statement will belong to all truth sets which lack <j> and con¬ 
tain ^ and x> and so on. Table 1 shows therefore that the whole 
statement belongs to all its truth sets, regardless of what the nature 
and number of its ultimate truth-functional components may be. 
The table shows that any statement of the form r <£ 3 \j/ . ip 3 x -V. 
X 3 is tautologous, regardless of the complexity of <j>, \p, and x- 

The term ‘tautology’ is taken from Wittgenstein. The present notion of tautol¬ 
ogous statements, as those true by virtue solely of truth-functional composition, 
seems to agree with his usage; he contrives to make the term cover truths which 
involve also quantification, but this is consequent only upon an effort to explain 
quantification as a sort of infinite mode of truth-functional composition. A broader 
use of the term ‘tautologous’ has arisen in subsequent literature, because of Witt¬ 
genstein’s doctrine that all mathematics and logic is tautologous. This doctrine 
was intended by Wittgenstein as a thesis, not as a definition of tautology; and in¬ 
deed it is a difficult thesis to defend. But some who do not maintain the thesis in 
any such form, and who regard the inferences of logic and mathematics as “merely 
verbal transformations” or “disguised repetitions” only in some much broader 
sense, have been led thus to transfer the term ‘tautologous’ to this broader sense. 
It is not clear just what this broader sense is (cf. my “Truth by Convention”); 
but, whatever it is, there is already a term of long standing ready at hand for it — 
Kant’s term ‘analytic’. Hence, following a suggestion of Carnap’s, I am confining 
the term ‘tautologous’ to the narrower sense — though in abstraction from Witt¬ 
genstein’s theories. 

A rule amounting substantially to the tabular test of tautology was set forth by 
Peirce (3.387 f), and is indeed implicit in Boole’s general rule of development. 
The rigorous formulation of tautology presented above is new. Another formulation, 
equally strict but more complex, has been offered by Tarski (“ Untersuchungen,” 
Def. 4). 


§11. Selected Tautologous Forms 

THERE IS no en'd to the variety of ways in which statements can 
be compounded by truth-functional connectives to produce tau¬ 
tology. From among this infinity of forms, certain samples were 
considered in the preceding section. Further samples, all of them 
conditionals or biconditionals, will now be recorded and com¬ 
mented upon. 
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Where p is 

any statement, 

the statements 

(1) 

r d> 

= Pfl 

(2) 

r P 

= ~ ~ <£"% 

(3) 

r P 

^.p.pfl 

(4) 

r P 

=. p V p'' 


are tautologous. 1 This can be verified by constructing four'two-row 
tables, in the manner of Tables 3 and 4 of § 10. The correlative 
conditionals r 4> D <jP, r p D ~ ~ <£ n , r ~ ~ <p 3 </T- etc. are of course 
likewise tautologous. 

A binary mode of statement composition, rendered say by the 
connective k, is said to be reflexive if r <j> k <jp is true for all state¬ 
ments <t> , and idempotent if r 4> =. p k <jP is true for all statements 
<f>. The tautology of r <j> = tjp and r 4> D thus shows the reflexivity 
of the biconditional and the conditional; and (3) and (4) reflect the 
idempotence of conjunction and alternation. The law of double 
denial, according to which consecutive denial signs cancel, is re¬ 
flected in (2). 

The property of tautology provides derivative relations of 
tautologous implication and equivalence which are narrower than 
logical implication and equivalence (cf. § 5). <t> may be said to 

imply tautologously when r <f> D is tautologous; and <j> may be 
said to be tautologously equivalent to when r 4> = is tautolo¬ 
gous. The antecedent of a rautologous conditional tautologously 
implies the consequent, and the two sides of a tautologous bicon¬ 
ditional are tautologously equivalent. From the tautologous forms 
listed in (1)—(4), thus, we see. that any statement is tautologously 
equivalent to itself, to its double denial, to its self-conjunction, and 
to its self-alternation. 

Where <f> and p are any statements, the conditionals 

(5) r <t>. p . 3 <jP, 

(6) r p D. p v 

are tautologous. This can be verified by construction of four-row 
tables, in the manner of Table 2 of § 10. Conjunctions tautolc- 

1 Note incidentally that (2)-(4) are merely different abbreviations, by Dl-3, of 
one and the same form: 


r <t> = up j py.l (<t> | p)fl. 
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gously imply their components, and alternations are tautologously 
implied by them. 

A mode of statement composition is said to be commutative if 
r p k \p . = . 4 / k <f is true for all statements p and 4'- Conjunction, 
alternation, and the biconditional are commutative; for, where p 
and 4 / are any statements, construction of tables shows that 


(7) 

n 

-e- 

jn 

■*- 

•©- 

(8) 

r pv 4/ . =. v p^, 

(9) 

r p = p . =. p = p 


are tautologous. 

DeMorgan s law, according to which a denied conjunction is 
equivalent to the alternation of the denials and a denied alternation 
is equivalent to the conjunction of the denials, is reflected in the 
following two forms of tautology. 

( 10 ) r ~ ( 4 > • 4 / ) ~ ^ v ~ 

(11) (<£ v I p) =. ~ p . ~ 1 

In view of D4, (11) has: 

( 12 ) r ~ (<f> D ip) =. <t >. ~ 4 / ~ l 

as a minor variant. The three forms (10)-(12) show how to break 
up a denied conjunction, alternation, or conditional. The tautol¬ 
ogous form 

(13) r ~ (<f> = 4/) =. p = ~ \jP 

does the same for the biconditional. 

The law of transposition, according to which the antecedent and 
consequent of a conditional may be switched provided that each is 
changed to the extent of attaching or dropping an initial denial 
sign, is reflected in the tautologous forms 

(14) r <f>D 4' • = • ~ ~ p 1 , 

(15) r -— -p D 4 / ■ = ■ ~ 4* 3 <jP, 

(16) r p D ~ 4 / • = ■ 4* ~ 

Note that (16) is merely a case of (8), in view of D4. 

There are related tautologous forms for the biconditional: 

(17) r p = 4' ■=• ~ P = ~ 4^> 

( 18 ) r p = 4 / • = - ^ P = 4^’ 
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From (18) and (13) we see that the position of a denial sign in a 
biconditional is immaterial; it may apply to either component or 
to the whole with no difference of effect. 

The next four tautologous forms show ways of transforming a 
conjunction, alternation, or conditional into a biconditional. 

(19) r 4>. $ . = :<p 

( 20 ) r </> v \p . = '.\p = ■ </> 3 V- 

(21) r 4> D \P . = : <t> =.<(>. vC. 

( 22 ) r <pDf. = :^ =.4>v^. 

The laws of development, whereby a statement <f> can be elab¬ 
orated into an equivalent which incorporates any desired second 
statement are depicted in the tautologous forms 

(23) r <p =. cf> v \p . <j> v ^ •f', 

(24) r 0 =: cp . \p .v. <p ■ ~ 

A mode of statement composition is said to be transitive if 
r <p K 'p . f K X • <P K X" 1 

is true for all statements <p, \p, and x- Conjunction, the conditional, 
and the biconditional are transitive; for, where <#>, i p, and x are any 
statements, construction of tables shows that 

(25) r <p.iP:ip. X '-1-4>-x', 

(26) r <pD\p.\pDx-3’4>^x', 

(27) r 0 = ft. \p = x -3- <P = x 1 

are tautologous. The tables wanted here will have eight rows 
apiece, in the manner of Table 1 of § 10. Note, however, that the 
labor of an eight-row table can often be reduced by a judicious ap¬ 
proach. Consider, e.g., (26). Any chance of falsifying this con¬ 
ditional would require falsifying the consequent r <p 3 xT and hence 
assigning ‘T’ to ip and ‘F’ to x; consequently our construction can 
be limited to two rows, corresponding to the assignment of ‘T’ 
and ‘F’ to the remaining component i p: 

r 4> D \p . \p 3 x -3. <f> 3 x" 1 
TTTFTFF T TFF 
TFFFFTF T TFF 
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The tautologous forms 

(28) r 4 > D \p .D: <p. x .3. P . x" 1 - 

(29) r <t> 3 <p <t> v x -3- ’Z' v x" 1 

show that a true conditional remains true when antecedent and 
consequent are elaborated uniformly by conjunction or alternation. 
The tautologous forms 

(30) r <f> 3. t 3 x: =: i 3. <t> 3 x n > 

(31) r 4> 3. \p 3 x 4> • t -3 x -1 

show two useful ways of transforming an iterated conditional. A 
third way appears in (44), below. 

A mode of statement composition is said to be associative if 

r {<t>K\p) K X .=- 4> K (<pK X) 1 

is true for all statements <f>, i p, and x; in short, if grouping is im¬ 
material. Conjunction, alternation, and the biconditional are 
associative; for, where 4>, \p, and x are any statements, the state¬ 
ments 

(32) r <f>. i : x : =: 4 >: p . xA 

(33) r <p v p .v x : =: <t> v . p v xA 

(34) r 4> = p .= x : = : P=- P = x n 

are tautologous. The associativity of conjunction and alterna¬ 
tion was observed earlier (§ 7); but the associativity of the 
biconditional is perhaps surprising enough to move the reader to 
work out the table for (34) himself. 

In the case of conjunction and alternation, associativity 
prompted a special notational convention (D6-7, §9); but a 
corresponding convention for the biconditional is inadvisable. 
The notation r <f> = p = x" 1 is likely to suggest, not r (/i s X n 
nor r 4> =.\p = -)C, but rather r <p = p . p = x 1 — on the analogy of 
the continued equation ‘x = y = z’. The conditions under which 
r 4> = p .= x' (or r (p =. ip = X n ) is true are not those under which 
r <p = ip . \p = x n is true; the latter is true when p, ip, and x ate 
alike in truth value, but the former proves to be true rather when 
just one or all of p, ip, and x are true. 

In an iterated conjunction, alternation, or biconditional the 
order and grouping of components have been seen to be immaterial 
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(commutativity and associativity). In the case of conjunction and 
alternation, repetition of components has likewise been seen to be 
immaterial (idempotence). Any two iterated conjunctions, formed 
from the same components with or without repetitions, are equiv¬ 
alent — tautologously equivalent; and similarly for alternation. 

But the biconditional, which lacks idempotence, happens to 
behave yet more simply under repetition of components: the 
repetitions cancel out in pairs, in view of the fact that 

( 35 ) r 4> =: <t> = \p . = ^ 

is tautologous. Two iterated biconditionals are taurologouslv 
equivalent whenever the components appearing an odd number of 
times in each are the same. This principle is a consequence of 
another equally curious one: any iterated biconditional is tautolo¬ 
gous if each component occurs an even number of times. 1 Further, 
any compound built up by the biconditional and denial is tautolo¬ 
gous if each component occurs an even number of times and there 
are an even number of denial signs.' 2 Note, e.g., that (1), (2), (9), 
(13), (17), (18), (34), and (35) involve <p, i p, x, and ‘ ~ ’ twice each 
if at all. 

Given any binary modes of statement composition, rendered 
say by the respective connectives k and ,u, the first is said to be 
distributive into the second if 

r (j) K (\p Id x) • = • (<t> K i) M (4> K xV 

is true for all statements 4> , i/-, and x- s Distributivity always justi¬ 
fies an operation analogous to that of “multiplying through’’ in 
arithmetic: x(y + z) = xy + xz. Now conjunction turns out to be 
distributive into itself and alternation; and alternation and the 
conditional turn out to be distributive into conjunction, alterna¬ 
tion, the conditional, and the biconditional. Where <£, i/c and x are 
any statements; 

(36) r 4> \ . x 4> ■ ie \ 4>. x's 

(37) r <f>. i v x ■ =: <t> ■ t -v. <5 . xA 

1 Lesniewski, ‘ Grundzuge,” pp. 26, 26. 

2 This was pointed our to me by Dr. J. C. C. McKinsey. 

3 The first mode of composition may likewise be said to be distributive out of 

the second if r (<£ k \ ■ =. (<f> K x) t* * x) ' is true for all statements^,and 
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(38) 

r <f> v. ip . x 

<f> v \p . <P v x"\ 

(39) 

r 0 v. ip v x : = 

: (j> v p .v. <f> v x -1 , 

(40) 

r 4>v. ^Dx := 

: <p v ip .D. <j> v x -1 , 

(41) 

r <£ V. ip = x ■ = 

= : <f> m \p .=. 4> v x 1 ’. 

(42) 

r <t>J.ip .x :=■ 

, <f> D ip . <j> D x" 1 , 

(43) 

r <P^.f vx:= 

: <j> D p .v. ^ D x" 1 , 

(44) 

r 4>1. ipO x := 

: <f> D ip 3. <p 3 x', 

(45) 

r <t> D. ip = x := 

= :<pD\p.^. 4>1 x" 1 


are tautologous. 

Along with (42) and (43), note the further tautologous forms 

(46) r <t> .ip D x : 4> 3 x .v. 3 x" 1 , 

(47) r <pv<pJx'=-<P^x-i / ^x'- 

In view of (42), a conditional with conjunctive consequent amounts 
to a conjunction of conditionals; in view of (46), however, a con¬ 
ditional with conjunctive antecedent amounts to an alternation of 
conditionals. (47) contrasts with (43) in analogous fashion. 

The form (45) shows one useful way of transforming r <pD.ip = x" 1 ; 
the following shows another: 

(48) r <pD. ip = x <t> ■ 'P •=■ <f> ■ x" 1 . 

Continually, in subsequent developments (§§ 17 ff.), we shall 
find ourselves called upon to recognize simple forms of tautology. 
The method of truth tables is always available for the purpose; 
but since in many cases the form to be recognized will be one or 
another of the foregoing forms (1)—(48), or a minor variant thereof, 
the reader will save time by familiarizing himself with these. 

Most of the tautologous forms (l)-(48) are cited, along with numerous others, 
by Whitehead and Russell (*l-*5). The bulk of them were known to earlier 
authors — Frege, Peano, Schroder, Peirce, Boole. Many were indeed anticipated, 
as Lukasiewicz points out (“Zur Geschichte”), by the Stoics in ancient times and 
by Petrus Hispanus, Duns Scotus, and others in the Middle Ages; what has come 
to be known as De Morgan’s law, e.g., goes back to William of Ockham. One or 
more of (19), (20), (40), and (41) may be new to the literature. 

The terms ‘idempotent’, ‘commutative’, ‘associative’, and ‘distributive’ are 
borrowed from abstract algebra, where their use is closely analogous to that de¬ 
scribed above (cf. §33). The terms ‘reflexive’ and ‘transitive’ are from relation 
theory (cf. §42; also §25). 
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§ 12 . The Quantifier 


THE TRIVIAL statement: 

(1) 9 is less than, equal to, or greater than 0 

is translatable into arithmetical and logical symbols thus: 

(2) 9 < 0 .v. 9 = 0 .v. 9 > 0. 

But the statement: 

(3) Every number is less than, equal to, or greater than 0, 

despite its superficial resemblance to (1), is not correspondingly 
translatable as: 

(4) Every number < 0 .V. every number = 0 .v. every number 
> 0 ; 

for (3) is true, if we suppose imaginary numbers excluded, while 
(4) is false. The alternation (4) says that some one of the three 
categories exhausts all numbers, while (3) says only that the three 
categories together exhaust all numbers. (3) calls for expansion 
rather into some such form as this: 

Whatever number you may select, it will be less than or 
equal to or greater than 0; 

or, incorporating the arithmetical and logical notation: 

(5) Whatever number you may select, it < 0 .v. it — 0 .v. it 

> 0 . 

As a step toward further logical notation we might compress (5) 
thus: 

(6) whatever number (it < 0 .v. it = 0 .v. it > 0). 

Note, by way of contrast, that this rudimentary notation endows 
the falsehood (4) rather with the following form: 

whatever number (it < 0) .v. whatever number (it = 0) 

.v. whatever number (it > 0). 

What (2) says about the number 9 is, as we have noted, true of 
all numbers. This fact is expressed in (3), and again in (5) and (6). 

6 S 
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It is expressed in (6) by putting ‘it’ for ‘9’ in (2) and applying 
the prefix ‘whatever number’. But note now that what (3) says 
about 0 is in turn true of every number. A statement to this 
effect can be formed from (3) just as (6) was formed from (2): 

(7) whatever number (every number is less than, equal to, 
or greater than it). 

Now the parenthesized part of (7), viz.: 

(8) every number is less than, equal to, or greater than it, 

still wants translation into our new notation. Since (8) differs from 
(3) only in containing ‘it’ instead of ‘O’, the translation of (3) into 
(6) suggests translation of (8) into: 

(9) whatever number (it < it ,v. it = it .v. it > it). 

But this translation is inacceptable; it turns (7) into: 

(10) whatever number (whatever number (it < it .V. it = it 

•V. it > it)), 

wherein the distinction is lost between the ‘it’s which correspond 
to the outer occurrence of ‘whatever’ and the ‘it’s which corre¬ 
spond to the inner occurrence of ‘whatever’. 

This difficulty can be overcome by returning to (6) and (7) and 
revising our basic notation, to the extent of tagging ‘whatever’ and 
‘it’ with one or another arbitrary subscript to show that they 
belong together. (7) and (6) might be rendered respectively: 

(11) whatever] number (every number is less than, equal to, 
or greater than it;), 

(12) whatever number (it 2 <7 0 .v. it 2 = 0 .v. it 2 > 0). 

Now the parenthesized part of (11), viz.: 

(13) every number is less than, equal to, or greater than iti, 

differs from (3) only in containing ‘it]’ instead of ‘O’. Parallel to 
the translation of (3) into (12), we can translate (13) into: 

(14) whatever number (it 2 < iti .v. it 2 = it] .v. it 2 > iti), 

which differs from (12) only in containing ‘iti’ instead of ‘0’ 
Thus (11) in its entirety becomes: 
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(15) whatever! number (whatever number (it 2 < iti .V. it 2 = iti 
■V. its > iti)). 

Such use of two sets of ‘it’s with distinguishing subscripts 
corresponds to the common use of ‘former’ and ‘latter’. 
(15) might be rendered thus in ordinary language: 

Whatever number you may select, it will turn out, 
whatever number you may next select, that the latter 
is less than, equal to, or greater than the former. 

In more complex cases there may be more than two distinct sub¬ 
scripts; such cases are commonly handled in ordinary language, 
not by the ‘former-latter’ idiom, but by the device of speaking of 
the first, second, third, etc. in an arbitrary order. It is after this 
device that the present use of numerical subscripts is patterned. 

But a more compact and convenient notation consists in using 
simply different Utters, instead of ‘it’ with different subscripts, and 
then using the same letter in parentheses instead of ‘whatever’ 
with the corresponding subscript. Thus revised, (12) and (15) 
appear as follows: 

(16) (y) number (y < 0 ,v. y = 0 ,v. y > 0), 

(17) (x) number ((y) number (y < x .v. y = x .V. y > x)). 
Similarly ‘All men are mortal’, or ‘Every man is mortal’, becomes: 

(18) (x) man (x is mortal). 

Again, ‘No man has seen every city’, or in other words ‘Every man 
is in the position of not having seen all cities’, becomes: 

(x) man ~ (x has seen all cities). 

But ‘x has seen all cities’, or ‘ x has seen every city’, gives way 
similarly to ‘ (y) city (x has seen y)’; so the whole becomes: 

(19) (x) man ~ (y) city (x has seen y). 

Such prefixes as ‘ (x) number’, ‘ (x) man’, ‘(y) city’ may be 
read ‘no matter what number x may be’, ‘no matter what man x 
may be’, ‘no matter what city y may be’. The tentative sym¬ 
bolism at which we have arrived is thus a regimentation of the 
familiar practice, in algebra and elsewhere, of using letters to 
achieve generality. An algebraic discussion which contains ‘ x ’, 
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and is supposed to hold for every number x, can now be explained 
as tacitly prefaced in the fashion 1 (x) number'. The algebraic use 
of letters was characterized earlier (§ 6) in a rough, tentative way; 
the letters were spoken of as names of unspecified numbers, am¬ 
biguous names of numbers, signs replaceable by names of any 
numbers. Now the status of such letters is clearer; the use of 
‘x’, ‘y’, etc. in connection with prefixes ‘ (x) number’, ‘ (y) num¬ 
ber’, etc. involves nothing beyond what was just now explained in 
terms of ‘whatever’ and pronouns. There are indeed uses of ‘ x ’ 
in algebra which do not lend themselves immediately to formula¬ 
tion in terms of the idiom ‘ (x) number’, or ‘no matter what num¬ 
ber x may be’; often the appropriate prefix is rather ‘there is a 
number x such that’, or ‘the number x such that’, or ‘the class of 
all numbers x such that’. It will be found, however (§§ 19, 26, 27) 
that all such apparently divergent uses admit finally of translation 
into forms involving just the idiom now under consideration. 

What has been said concerning algebra applies also to the 
analogous use of letters in non-numerical fields; e.g., to the use of 
Greek letters in discussions of expressions. Use of V in saying 
something about all expressions, or of “</>’ in saying something about 
all statements, would now be explained in terms of such prefixes as 
‘(/u) expression’ and ‘(()>) statement’ — or better ‘ (x) expression’, 
‘ (x) statement’. To insist on explicit reformulations of this kind 
in all future discussions would, however, be as unwarranted as to 
insist on elimination of the words ‘or’, ‘and’, and ‘if-then’ from 
future discussions in favor of ‘v’, and ‘D’. Most things can be 
explained clearly enough by the common-sense sort of statement; 
it is rather when the statement itself becomes an object of analysis 
that its full translation into the logical notation becomes partic¬ 
ularly advantageous. 

The notation exemplified in (16)—(19) is only tentative, and will 
now undergo further evolution. Consider the statement ‘Every¬ 
thing is identical with itself’. Paraphrased in the manner of (5), 
this becomes: 

(20) Whatever you may select, it = it. 

But the limitation imposed by the word ‘number’, in (5), has no 
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analogue in (20). We may appropriately put (20) into symbols by 
following the pattern of (16) and (18) but omitting any range- 
indicator such as ‘number’, ‘man’. We have: 

(21) ( x)(x = x). 

Just as the prefix ‘ (x) number’ may be read ‘no matter what 
number x may be’, so the simple prefix ‘(x)’, as of (21), may be 
read simply ‘no matter what x may be’. Now a prefix such as 
‘(x) number’, ‘(x) man’, etc. can be eliminated, always, in favor 
of the simple kind of prefix used in (21). E.g., the statement: 

(22) No matter what man x may be, x is mortal 
can be paraphrased as: 

(23) No matter what x may be, if x is a man then x is mortal. 

Instead of limiting our assertion to men at the start, we admit all 
entities at the start and then employ the conditional form to spare 
the non-men. Thus (18) gives way to: 

(24) (x)(x is a man .3. x is mortal). 

In general, the form ‘ (x) M (-■)’ gives way to ‘(x)(x is an M 

.3. -)’. (16), (17), and (19) become: 

(25) (y)(y is a number .3: y < 0 .v. y = 0 .v. y > 0), 

(26) (x)(x is a number .3 

(y)(y is a number .3: y < x .v. y = x .v. y > x)), 

(27) (x)(x is a man .3 ~ (y)( y is a city .D. x has seen y)). 

The notation (16)—(19) involving range-indicators will never be 
used hereafter; the notation exemplified by (21) and (24)—(27) is 
adopted as standard. 

The prefixes ‘ ( x ) ’, ‘ (y) ’, etc. are called quantifiers, and their 
use in forming statements is known as quantification. The letters 
‘x\ ‘y’, V, etc. themselves are called variables. In order that 
the letters available as variables may not be limited by the bounds 
of the alphabet, further ones are formed by means of accents; thus 
the variables will be understood as comprising ‘w’, ‘x’, ‘y’, ‘z’, 

( t > * i y i /1 i y < //> 

w , x ) y , i ) w , . . .. 

Though the word ‘variable’ has been retained here out of con- 
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sideration for established usage, the “variation” connoted belongs 
to a vague metaphor which is best forgotten. The variables have 
no meaning beyond the pronominal sort of meaning which is re¬ 
flected in translations such as (20); they serve merely to indicate 
cross-references to various positions of quantification. Such cross- 
references could be made instead by curved lines or bonds ; e.g., we 
might render (27) thus: 


( ) ( is a man .3 ~ ( ) ( is a city .3/^ has seen \ )) 

and (26) thus: 



( ') ( is a number .3 ( ) ( is a number .37^<".vvS^ )). 


But these “quantificational diagrams” are too cumbersome to 
recommend themselves as a practical notation; hence the use of 
variables. 

Quantification cuts across the vernacular use of ‘all’, ‘every’, 
‘any’, and also ‘some’, ‘a certain’, etc. (cf. §19), in such fashion 
as to clear away a baffling tangle of ambiguities and obscurities. 
One of the anomalies of the common idiom was noted at the begin¬ 
ning of the section, and many further instances suggest themselves. 
Consider e.g. the statements: 

(28) Smith can outrun every man on the team, 

(29) Smith can outrun any man on the team, 

(30) Smith cannot outrun every man on the team, 

(31) Smith cannot outrun any man on the team. 

Clearly (28) and (29) are equivalent, and (30) is the denial of 
(28); but (31) is not correspondingly the denial of (29). Whereas 
(28) and (29) are: 

(32) ( x)(x is a man on the team .3. Smith can outrun x) 
and (30) is: 

(33) ~ ( x)(x is a man on the team .3. Smith can outrun x), 
(31) is rather: 
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(34) ( x)(x is a man on the team .D. ~ Smith can outrun x). 

The faultiness of the vernacular is illustrated again in the tendency 
to blur the distinction which is dealt with below in *139 (§ 19). 
The device of quantification subjects this level of discourse, for the 
first time, to a clear and general algorithm. It reveals the precise 
connection, hitherto obscure, between general statements and 
truth-functional composition; in (25) and (32)-(34), e.g., unlike 
their idiomatic equivalents (3) and (28)-(31), the relevant truth- 
functional structure is clearly exhibited. 

Frege (1879) was the first to devise a general notation of quantification, using 
auxiliary variables in the modern fashion. So important was this step that we 
might indeed look upon Frege, rather than Boole, as the founder of modern logic. 
The present notation, easier to print than Frege’s, is from Whitehead and Russell. 
The pronominal character of the variable was clear to Peano ( Formulaire , 1897, 
p. 26; 1901, p. 2); but it is only with the advent of combinatory logic, founded by 
Schonfinkel and developed by Curry, that the role of the variable as an index of 
cross-reference has received full analysis. The analysis consists in showing how 
variables can be eliminated in favor of a few constant terms designating functions 
of functions (or relations of relations). See Schonfinkel; also Curry’s “Grund- 
lagen,” “Apparent Variables,” and “Functionality,” Rosser’s “Mathematical 
Logic”, and my “Reinterpretation” (which cites further papers by Curry). 


§ 13 . Formula 

WHEREAS ‘ (x){x is blue)’ is a false statement to the effect that 
everything is blue, and ‘(x)(x — x)’ is a true statement to the 
effect that everything is itself, the components ‘x is blue’ and 
‘x = x’ are neither true nor false; they are not statements at all, 
but statement matrices — expressions which would be statements 
if they contained names instead of variables. Between ‘x = x’ 
and ‘x is blue’ there is, indeed, this difference; all statements 
formed from ‘x = x’ by putting a name in place of ‘ x’ are true, 
while some statements so formed from ‘x is blue’ are false. But this 
difference does not make ‘x = x’ itself true, nor ‘x is blue’ false. 
Matrices are not statements, and only statements are true or false. 

A matrix is not, however, meaningless, any more than an adverb 
or preposition is meaningless. Matrices are fragmentary, but 
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meaningful in that they can serve as ingredients of genuine state¬ 
ments like ‘ (x){x is blue)’ and ‘ (*)(* = x) Since ‘(x)(x is 
blue)’ and ‘ (x)(x = x) ’ are translatable as ‘Whatever you may 
select, it is blue’ and ‘Whatever you may select, it = it’, the 
matrices ‘x is blue’ and ‘x = x’ may be regarded as corresponding 
to the expressions ‘it is blue’ and ‘it = it’ — expressions resembling 
statements in form but incapable of standing alone because of their 
dangling pronouns. 

Since matrices differ from statements only in exhibiting variables 
in place of names, matrices are wanted in all forms of which state¬ 
ments are capable. Quantification and the truth-functional modes 
of composition thus cease to be methods merely of constructing 
statements; they are to be regarded henceforth as methods of con¬ 
structing statements and matrices indifferently. Furthermore the 
notion of tautology admits of immediate extension to matrices: a 
matrix is tautologous if it has the form of a tautologous statement, 
i.e., if it becomes a tautologous statement when variables are sup¬ 
planted by names. A tautologous matrix is not, like a tautologous 
statement, true; still the tabular test of tautology can be applied 
directly to the matrix, for the pattern on which the test turns is 
unaffected by the presence of variables. The notions of tautologous 
implication and equivalence, and logical implication and equiv¬ 
alence, admit similarly of immediate extension to matrices. Be¬ 
cause of these affinities between statements and matrices, it will be 
convenient henceforth to treat statements and matrices together 
under a single head; they will be called formula. 

In Chapter I no attempt was made to mark out the category 
of expressions known as statements. We now proceed to impose 
bounds on that category; indirectly, however, through imposing 
bounds on the broader category of formula. This category will 
retain a degree of relativity, for the space of the present chapter, 
but is subjected to the following restriction: no formula can occur 
within another except through quantification or truth-functional 
composition. Instead of truth-functional composition, indeed, we 
might cite joint denial alone; for conjunction, denial, and the rest 
may be regarded as unofficial abbreviations after the manner of 
Dl-S (§ 9). 
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Thus we are to think of the totality of formulae as comprising 
some manner of atomic formula — formulas containing no shorter 
formulae as parts — and in addition just the expressions thence 
constructible by continued application of joint denial and quanti¬ 
fication. In the next chapter, expressions which are to be taken as 
atomic formulae will be specified; but for present developments 
we may leave the class of atomic formulae still undetermined. 
Relative to the atomic formulas, however, the totality of formulae 
is fixed; it comprises the class of atomic formulae, together with 
all results of putting formulae of that class in the blanks 
of ‘( J. ) ’ or after a quantifier, together with all results of 
putting formulae of the thus increased class in the blanks of 
‘( J. ) ’ or after a quantifier, and so on. If /u and v are 
among the atomic formulae, and a and |3 are variables (‘w’, ‘x’, 
‘y’, or ‘z’ with or without accents), then 

fi, v, r (a)iT, r (ct)v~', r (/u J, M r, r (n J. v)~', r {(a)n J, (a)nY, 

r ((a)n | ju)" 1 , r (fi | (P)vY, r ((o)/i j (a)vy, r ((a)n j (/3)*')" 1 , 
r (a)(ju | vY, r (O)0 l v) [ nY, r ((“)(M j (/3» j (P)vYt 

etc. are among the formulae. 

So long as the atomic formulae remain unspecified, it is impossible 
to present any specific formulae by way of examples; we can only 
give partially determinate examples with help of Greek letters, 
as above. When specific examples seem wanted, however, we may 
conveniently avail ourselves of random components ‘x = x’, 
‘x < 5’, ‘3 < 5’, ‘Socrates is mortal’, etc., under the fiction that 
these are among our atomic formulae. 

In the requirement that no formula contain another except in a 
context of joint denial or quantification, an extreme restriction 
would seem to be imposed. Actually, however, nothing is thereby 
barred which is needed anywhere in mathematics. For all mathe¬ 
matical purposes it is sufficient that formulae occur in three ways: 
alone, in joint denial, in quantification. Such other manners of 
occurrence as may turn up in mathematical practice are reducible 
to these by definitional translation. There is reason to believe, in¬ 
deed, that this is true not only of mathematics but of discourse 
generally; this thesis is a controversial one, however, being an 
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extension of the thesis that all statement composition can be 
limited to truth-functional composition (cf. §§ 1, 5). 

There are at the same time some respects in which the above 
characterization of formulae is more liberal than § 12 would have 
led one to expect. It allows the attachment of any quantifier 
r (a)' 1 to any formula n, whereas the explanation in § 12 covers 
only the case where n is a matrix involving a. Toleration of 
r (a)/i"' where n lacks a is actually a matter only of convenience; 
r (a)ff is to be construed in such cases as equivalent simply to /u, 
i.e., the quantifier is to be construed as vacuous. Thus the state¬ 
ment : 

(at) (Socrates is mortal) 

is explained as equivalent simply to ‘Socrates is mortal’. The 
verbal analogue ‘Whatever you may select, Socrates is mortal’ 
(cf. (20), § 12) reflects the vacuousness of the quantifier. 

In allowing attachment of every quantifier to every formula we 
let down the bars also to superimposition of identical quantifiers, 
as here: 


(1) (a:) (a: is a man , ( x)(x is a man .3. x is mortal) .3. x is mortal). 

The question then arises: which of the two occurrences of 
‘(a:)’ is supposed to govern the occurrences of ‘x’ in ‘x is a man 
.3. x is mortal’? But the usage is unambiguous if we agree that 
occurrences of a variable a overlaid thus by duplicate occurrences 
of r (a) n are to be governed always by the innermost occurrence of 
r (a)~'. The interior segment: 

(2) (a:) (a? is a man .3. x is mortal) 

of (1) is to be read, without regard to its context, in the usual 
fashion ‘all men are mortal’. (1) as a whole thus becomes: 

(a:)(a: is a man . all men are mortal .3. x is mortal), 

or, read entirely in words, ‘Whatever you may select, if it is a man 
and all men are mortal then it is mortal’. The letter V in (2) 
could be rewritten as ‘y’ without regard to the outer parts of (1). 
(1) is equivalent to: 

(3) (*)(* is a man. (y)(y is a man S). y is mortal) .3. x is mortal). 
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It is clearest in practice to choose different letters, as in (3), but 
quantification theory is greatly simplified by tolerating also the 
usage exemplified by (l). 1 

Strictly speaking the examples (l)-(3) are not formulae, but 
abbreviations of formulae; to get the corresponding formulae we 
must expand the abbreviations and ‘3’ into terms of ‘ j. 1 ac¬ 
cording to Dl-5. In practice, however, this distinction can. con¬ 
veniently be slurred over by keeping to the abbreviations and 
imagining meanwhile that the full primitive notation is written 
out instead. Except in the few cases where the abbreviations 
themselves are the topic of discussion, remarks ostensibly applied 
to abbreviated expressions are to be thought of as really applying 
to the corresponding expressions in unabbreviated notation. Thus 
it is that (1)—(3) and their like are treated as formuke. 

Hereafter ‘4>’, ‘4/’, ‘x\ and their accented and subscripted 
variants will be used not merely to refer to statements but to refer 
to formulae generally — statements or matrices. This extension is 
to be understood as applying also retroactively; thus the Greek 
letters in Dl-7 (§ 9), also in the explanation of the dot notation 
(§ 7), and also in the formulation of tautology (§ 10, italicized 
passages), are to be reread as referring to any formulae. Our 
formulation of tautology now comes to characterize tautologous 
statements and tautologous matrices simultaneously; and (1)- 
(48) of § 11 now present forms of tautologous formulae — state¬ 
ments and matrices indifferently. The formulae described in 
(l)-(48) are tautologous statements when cf>, \f/, and x happen to 
be chosen as statements, tautologous matrices otherwise. 

This modification of our Greek-letter usage will be accompanied 
by another: ‘a’, ‘/3’, ‘y’, ‘5’, and their accented and subscripted 
variants will not be used to refer to expressions indiscriminately, 
but will be limited to cases where the expressions referred to are the 
specific one-letter expressions ‘u>’, ‘x\ ‘y\ ‘%’, ‘w'\ etc. — the 
so-called variables. 


1 The usage in question is proscribed by Hilbert and Ackermann, tacitly avoided 
by Whitehead and Russell, and explicitly admitted by Frege (Grundgeselze, vol. 1, 
p. 13), von Neumann (“Beweistheorie,” p. 7), and Carnap ( Syntax , p. 88). 
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§ 14 . Bondage , Freedom , Closure 

THE NOTIONS of bondage, freedom, and closure, which are to be 
explained in this section, are used constantly in subsequent dis¬ 
cussion. A given occurrence of a is said to be bound to a given 
occurrence of r (a)~' if it stands in a formula beginning with the given 
occurrence of r (<*)’ 1 and stands in no formula beginning with a later 
occurrence of r (a) n . In (1) of § 13, e.g., the first, second, and last 
of the six occurrences of ‘ x’ are bound to the initial occurrence of 
‘( x )’, while the third, fourth, and fifth occurrences of ‘x’ are 
bound to the other occurrence of ‘(x)’. 

It was explained (§ 13) that an occurrence of a, if overlaid by 
duplicate occurrences of r («T, is to be regarded as governed by the 
innermost of those occurrences and as having only an alphabetical 
accident in common with the other occurrences. It is to this inner¬ 
most occurrence of ’"(a)" 1 that the given occurrence of a is said to 
be bound, under our newly introduced terminology. The occur¬ 
rence of r (a)’ 1 to which a given occurrence of a is bound is the 
occurrence of r (a)’ 1 which governs the given occurrence of a. 
The terminology ‘bound to’ receives graphic significance if we 
recall the notation of quantificational diagrams (§ 12); that nota¬ 
tion presents actual bonds, in the form of curved lines, connecting 
each quantifier-position with such subsequent variable-positions 
as are “bound thereto” in the above sense. (The little bond which 
always connects an occurrence of r (a) n with the occurrence of a 
inside it is of course not shown.) 

In applying the criterion of bondage italicized above, we must 
think of our binary statement compounds as fitted with the outside 
parentheses which belong to them under strict notation (cf. § 7). 
Otherwise one is tempted to say e.g. that the final occurrence of 
‘x’ in (1) of § 13 is bound to the second occurrence of ‘(x)’, on the 
ground that the final occurrence of ‘x’ stands in a formula: 

{x)(x is a man .3. x is mortal) .3. x is mortal 
which begins with the occurrence of ‘ ( x ) ’ in question.. Actually, 
however, (1) of § 13 becomes: 
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(1) (x)((x is a man . (x)(x is a man .3. x is mortal)) 3. x is mortal) 

on restoration of parentheses; and obviously its terminal frag¬ 
ment: 

(x)(x is a man .3. x is mortal)) 3. x is mortal), 

with unmated parentheses, is no formula. Again, one might be 
tempted to say that the last occurrence of ‘x’ in: 

(2) 0)0 = y + x) 3. y ^ a 

is bound to the occurrence of ‘ (x) on the ground that (2) is a 
formula beginning with the given occurrence of actually, 

however, (2) becomes: 

(3) (OX* = J + *) 3. y ^ x) 

on restoration of parentheses, and (3) does not begin with ‘ 0) ’• 
Discernment of bondage relations does not in practice require 
that we restore absent parentheses, so long as we keep in mind the 
intended cleavages of our formulae. We must keep in mind, e.g., 
that (2) is not formed by applying ‘0)’ to: 

0 = y + x) 3. y ^ x. 

An occurrence of ^ in </> will be said to be bound in <p with respect 
to a if within that occurrence of \p there is an occurrence of a 
which is bound to an occurrence of r (a)" 1 lying in <p but outside \p. 
In (2), e.g., the occurrence of ‘x = y + x’ is bound with respect to 
‘x’; for the occurrences of ‘x’ in ‘x = y + x’ are bound to an 
occurrence of ‘ (x) ’ in (2) which is outside ‘x = y + x’. In (1), 
each occurrence of ‘x is a man’ and ‘x is mortal’ is bound with 
respect to ‘ x’. 

An occurrence of <p will be said simply to be bound in <p if it is 
bound in <j> with respect to at least one variable. Thus the occur¬ 
rence of ‘x = y + x’ is bound in (2), whereas the occurrence of 
‘y 5$ x’ is not. In terms of quantificational diagrams, bondage of 
ip in p has this significance: there is at least one bond reaching into 
tp from somewhere in <p anterior to \p. 

Obviously an occurrence of \p may be bound in a broad context 
<p and yet not bound in a narrower context <£'; for, the bondage of 
\p in <p may depend on occurrences of r (a)" 1 , r (/3)‘ 1 , etc. which, though 
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within <f>, are outside </>'. Diagrammatically: the bonds reaching 
into from anterior parts of <p may begin anterior also to <t>'. The 
first occurrence of ‘x is a man’, e.g., is bound in the broad context 
(1) but not in the narrower context: 

(4) a is a man . (x){x is a man .D. x is mortal). 

The terminology ‘bound in’ will be applied not only to an occur¬ 
rence of a formula \p, as above, but also to an occurrence of a vari¬ 
able — in the following sense: an occurrence of a will be said to be 
bound in <p if it is bound to an occurrence of r (a) n in <j>. Thus the 
first three occurrences of ‘x’ are bound in (2); the fourth is not. 
An occurrence of a which is bound in a broad context <p need not be 
bound in a narrower context <f>\ since the occurrence of r (a)~’ to 
which the given occurrence of a is bound may lie inside </> but out¬ 
side 4>'; the second and third occurrences of ‘x’ in (2), e.g., are 
bound in (2) and in ‘(x)(x = y + x)’ but not in ‘x = y + x’. 

The terminology ‘bound in’ will be applied not only to an oc¬ 
currence of a formula or variable, as described, but also to the 
formula or variable itself: a formula or variable will be said to be 
bound in 4> if an occurrence of it is bound in 4>. Thus ‘ x = y + x’ 
is bound in (2); so also is ‘ x\ Variables bound in <p will be spoken 
of also as bound variables of <j>. 

An occurrence of a formula or variable in <j> is said to be free in 
<t> if it is not bound in 0; and a formula or variable is said to be free 
in <f> if an occurrence of it is free in 4>. Thus ‘y ^ x’ is free in (2); 
so also are ‘ x’ and ‘y’. Variables free in 4> will be spoken of also as 
free variables of 4>. 

To describe one and the same occurrence of n as both bound and 
free in <f> would be, of course, a flat contradiction. On the other 
hand n itself may be both bound and free in <f >; i.e., it may have 
both a bound and a free occurrence in <£. E.g., ‘x is a man’ is 

bound and free in (4). So is ‘ x’. 

It is obvious from the foregoing formulations that the following 
hold in general, (i) An occurrence of is bound in <j> with respect 
to a if and only if it contains an occurrence of a which is bound in 
<f> and free in \p. (ii) An occurrence of a in <p is bound to the initial 
occurrence of r (a) n in r {a)4P if and only if it is free in f>. 
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The terminology which we have been developing enables us, 
among other things, to render explicit the distinction between 
statements and matrices. The distinction does not turn merely 
on the absence or presence of variables, for ‘{x){x is blue)’ and 
‘(x)(at = x)’ are statements despite their variables; it turns rather 
on the fact that in a statement every occurrence of a variable is 
governed by a quantifier, while a matrix contains occurrences 
which are not thus governed. Thus, given the totality of formula, 
we may describe matrices and statements respectively as those 
formulae which have free variables and those which have not. 

Every matrix can be turned into a statement, true or false, 
simply by prefixing quantifiers corresponding to all the free 
variables. The matrix ‘x < y . = . y > x\ e.g., is turned in this 
way into the true statement: 

(5) (*)(y)(* < y ■=■ y > *); 

and the matrix ‘x < y’ into the falsehood ‘(*)(y)(* < y)\ Such 
successive quantifiers apply, of course, one at a time to the matrix 
which immediately follows. Thus ‘(x)(y)(x < y )’ is the result of 
applying the quantifier ‘(x)’ to the matrix ‘(y)(.* < y)\ which is 
in turn the result of applying ‘(y)’ to ' {x < y)’; and similarly for 
(5). The strict reading of (5) is: ‘No matter what x may be, it 
will be the case, no matter what y may be, that x < y if and only 
if y > x. ’ It is more natural, however, to read such successive 
quantifiers together: ‘No matter what x and y may be, x < y if 
and only if y > x. ’ The order of the successive quantifiers is ob¬ 
viously immaterial; (5) could have been rendered equivalently 
as: 

(6) (y)(x)(x < y . = . y > x). 

(5) and (6) are logically equivalent statements; so are ‘(x)(y) 
(x < y)’ and ‘(y)(x)(x < y)\ 

If has n free variables, there are n! statements which can be 
formed by prefixing distinct quantifiers in one or another order 
corresponding to all the free variables; though all n! statements 
are, as just remarked, equivalent. An arbitrary one of these state¬ 
ments will be referred to as the closure of (/>; the one, say, in which 
the added quantifiers are applied in alphabetic order. Accordingly, 
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where «i, ai, . . . , a n (n f. 0) are in alphabetic order all the free 
variables of 4>, the closure of <j> is r (a„). . .(asKai)^; it is formed by 
applying r (<*i)~ 1 to <j>, then r (« 2 ) 1 to the result, and so on. The alpha¬ 
betic order of variables may for this and subsequent purposes be 
fixed arbitrarily thus: 

w x y z w' x' y' z' w" 

Thus (6), but not (5), is the closure of‘x < y , = .y > x'. Again, 
since ‘w’, ‘x’, and ‘z’ are in alphabetic order all the free variables 
of: 

( y) (w loves y .3. x hates z), 
the closure of this matrix is the falsehood: 

(7) (z)(.v)(M’)(y)(w/ loves y .3. x hates z). 

At the same time (7) is the closure of: 

(w)(y)(w loves y .3. x hates z) 
and of: (x)(w){y) [w loves y .3. x hates z). 

Also it is the closure of itself; every statement is its own closure. 
On the other hand (7) is not the closure of: 

w loves y .3. x hates z; 
the closure of this is rather: 

(z) (y)(Ar)(w)(w loves y .3. x hates z), 
which is equivalent to (7) but is not the same statement. 

Peano’s nomenclature for bound and free variables, carried over by Whitehead 
and Russell, was ‘apparent’ and ‘real’. The term ‘free’ was applied to variables 
by Hilbert (1922); but the full terminology of bondage and freedom elaborated 
above is, with the exception of ‘bound with respect to’, substantially von Neu¬ 
mann’s (“ Beweistheorie”, p. 9). The notion of closure is an adaptation from 
Carnap ( Syntax , p. 94). 


§15. Axioms of Quantification 


OUT OF THE totality of logically true statements, one special 
kind — the tautologous — has thus far been considered. State¬ 
ments of this kind are true by virtue solely of their structure in 
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terms of truth-functional composition; i.e., in terms of joint 
denial. Now the advent of quantification opens up a wider class of 
logical truths: statements which are true by virtue of their struc¬ 
ture in terms of joint denial and quantification. These may be 
called quantificationally true. Church has shown 1 that it is impos¬ 
sible to devise a general test whereby, given any statement, we can 
mechanically decide whether or not it is quantificationally true. 
Quantificational truth is thus more elusive than tautology; for 
we have a mechanical test of tautology in the truth table. 

It remains possible to mark out the class of quantificational 
truths by appeal to a generative principle, as follows. We first sin¬ 
gle out certain fundamental sets of quantificational truths which 
are satisfactorily recognizable, either on sight or by mechanical 
tests; then we describe a simple rule (modus ponens, § 16) whereby 
further quantificational truths can be progressively generated. The 
demarcation of quantificational truths which is thus achieved is 
complete, in that every quantificational truth can in fact be gener¬ 
ated from the fundamental ones by some chain of applications of the 
rule; 2 but the appropriate chain of applications may elude us in 
particular cases, and herein consists the lack of a mechanical test. 

The quantificational truths are but a species in turn of logical 
truths in a still more inclusive sense (§ 28). We shall eventually 
find (§ 6o) that this latter category resists even the generative type 
of demarcation available for quantificational truth. 

In this section the fundamental sets of quantificational truths 
alluded to above will be specified. One such set is typified by: 

(1) (x)(x is red .3. x is red), 

which is the closure of a tautologous matrix ‘x is red .3. x is red’. 
The quantifier is essential to the truth of this statement, for with¬ 
out it we have only a matrix, neither true nor false. Tautologous 
formulae are not always true, for they are not always statements; 
their closures are statements, however, and true. Here we have a 
new set of logical truths, more inclusive than the tautologous state- 

1 Church, “ A Note.” See also Hilbert and Bernays, vol. 2, pp. 416 (F, and Rosser, 
“Informal Exposition.” 

- Oodel proved this in 1930. See also Henkin. These arguments relate to other 
versions of Quantification theory, but can be transferred to the present one. 
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ments; namely, the closures of tautologous formulae. Let us call 
this set A. It includes the tautologous statements, since a tautol¬ 
ogous statement is a tautologous formula which is its own closure. 

Next consider the statement: 

(*)(# is colored .3. x is extended) 3. 

(x)(x is colored) 3 (x)(x is extended); 
in words, ‘If everything colored is extended then if everything is 
colored everything is extended.’ Any statement of this sort — any 
statement of the form 

(2) r (a)(<£ 3 4/) 3. 3 (a)^ n 

— will of course be true. If <f> or ^ has a free variable other than a, 
the whole will be a matrix rather than a statement; but still its 
closure will be true. The closures of formulae of the form indicated 
in (2) thus constitute another set of logical truths, which we may 
call B. 

Logical truths of a trivial kind, exemplified by: 

Socrates is mortal .3 (x) (Socrates is mortal), 

issue from our decision (§ 13) to construe r (a)<t>' as equivalent to </> 
in cases where 4> lacks a. In general, where <f> lacks a, the closure of 
r <j> 3 (and, indeed, of r 4> = ( a will be true. Furthermore, 
the requirement that </> “lack” a need not be taken to forbid 
occurrences of a within a shorter quantification r {a)\(P inside-<£; 
for such occurrences are bound to that inner occurrence of r (a)' 1 , 
and are irrelevant to the initial r (a)"' of (cf. §§ 13, 14). The 

only occurrences of a in <j> which are bound to the initial r (a}" 1 
of r (a)<f>~' are the occurrences which are free in <f> (cf. (ii), § 14); 
and it is only such occurrences that need be forbidden in order that 
r (a)</>‘ 1 be a case of vacuous quantification, equivalent to <j>. Here, 
then, is another set of logical truths: closures of formulae r <t> 3 
(a)^" 1 such that a is not free in <f>. Let us call this set C. 

Now consider the statements: 

(3) (x)(x is a man .3. x is mortal) 3: 

Socrates is a man .3. Socrates is mortal, 

(4) (x) (God created x) 3. God created Socrates, 

(5) (x )(God created x) 3. God created God; 



§15 AXIOMS OF QUANTIFICATION 83 

in words, ‘If all men are mortal then if Socrates is a man Socrates is 
mortal’; ‘If God created everything then God created Socrates’; 
‘If God created everything then God created himself.’ Each of 
these statements is a conditional of the form r (a)<£ 3 4>'~' where </>' 
affirms of some specific object (Socrates, God) what r (a)<jP 
affirms of all objects. In each case < fi' is like <£ except for containing 
some name f (‘Socrates’, ‘God’) where <j> contains a. More 
accurately, in general, <£' is like <j> except for containing f wherever 
4> contains free occurrences of a; for it is only the free occurrences 
of a in <j> that are relevant to the initial r (a) n of r (a)<jP. Any 
conditional statement of this kind will of course be true, since 
whatever can be affirmed of everything (as in r (a)0“ 1 ) can be 
affirmed of any particular object we choose to name (as in <j>'). 

Furthermore, the references to Socrates are obviously inessential 
to the truth of (3) and (4); corresponding conditionals are true of 
all men, indeed all entities, not just Socrates. Hence the general 
statement which is formed from (4) or from (3) by putting a 
variable for ‘Socrates’, and quantifying the whole, is likewise true: 

( 6 ) (y)(( x )(God created x) 3. God created y), 

(7) (y) ((x) (a is a man .3. x is mortal) 3: y is a man .3. y is mortal). 
Similarly the references to God are inessential to the truth of (5) 
and (6); the general statements formed from (5) and (6) by putting 
a variable for ‘God’, and quantifying the wholes, are likewise 
true: 

(8) (tv)((x) (tv created x) 3. w created tv), 

(9) {w){y)((x)(iv created a) 3. w created y). 

Whereas (3)-(5) are statements r (a)<t> 3 4>'~ > such that is like 
<(> except for containing a name f wherever <j> contains free occur¬ 
rences of ot, on the other hand (6)-(9) are closures of formulae 
r (a)4> 3 4> n such that <j>' is like <j> except for containing a variable a 
wherever <f> contains free occurrences of a. Let this latter set of 
statements, exemplified by (6)-(9), be called D'. (D will come 
later.) This set does not include (3)-(5). Indeed, the set exempli¬ 
fied by (3)-(5) is not satisfactorily specifiable at the present stage; 
for its specification depends on the notion of name, and nothing has 
been said thus far to indicate what sorts of expressions are to be 
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classed as names. But the set exemplified by (3)-(5) can be 
passed over; we shall find later (§ 27) that names are eliminable — 
that statements involving names are translatable in a way which 
avoids recognition of any ultimate name-category. 1 

Along with (6)-(9), the set D' as described includes the state¬ 
ment: 

(10) (x)~(y)(y = x) D ~(y)(y = y); 

for, (10) is the closure of r (a)$ D if/" 1 where a is ‘x’, (j> is (y) 
(y = x) ’, and <f> is like <j> except for containing ‘y ’ wherever 0 con¬ 
tains free occurrences of ‘ x\ But this reveals a defect in the 
formulation of D', for (10) is in fact false. The antecedent of the 
conditional (10) says that, whatever entity may be selected, not 
every entity will be identical with it. This much is true; if e.g. 
the number 5 is selected then the number 1 is not identical with it, 
while if any other entity is selected then the number 5 is not identi¬ 
cal with it. The consequent of the conditional (10), on the other 
hand, is obviously false; it denies that everything is self-identical. 
Thus (10) as a whole, having a true antecedent and false conse¬ 
quent, is false. 

The formulae whose closures were intended to belong to D' were, 
roughly speaking, of the sort: 

(x)(. . .x. . .) D-y. . . 

where the consequent ‘. .y. . imposes upon y the condition 
which ‘. .x. . imposes upon x. Any such closure will indeed 
be true. But (10) is of a different sort; whereas its antecedent 
says: 

(a) ~ (everything is identical with x), 
its consequent does not say correspondingly: 

~ (everything is identical with y), 

but rather: 

~ (everything is self-identical). 

1 Names and other terms will subsequently make their formal appearance 
through certain conventions of notational abbreviation (§§ 24, 26, 27); and a 
principle covering the logical truths exemplified in (3)-(5) will then be derived 
(*231, §31). 
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The source of this deviation on the part of (10) is apparent: when 
the free occurrence of a (i.e. ' x’) in p (i.e. (y)(y = x)’) is 

supplanted by ex' (i.e. ‘y’ ), the resulting occurrence of a' comes 
inadvertently to be appropriated to the purposes of a quantifier 
r (a')" 1 which was lurking inside p. Our characterization of D' can 
be adjusted, therefore, by stipulating that the occurrences of «' in 
p', to which the free occurrences of a in p give way, remain like¬ 
wise free. The revised set, which may be called D, comprises the 
closures of formulae r (a)p 3 p' n such that p' is like p except for 
containing free occurrences of a' wherever p contains free occur¬ 
rences of a. 

Statements of the four kinds A, B, C, and D will play a basic 
role in subsequent developments, and a general term is needed for 
referring to them. For lack of a better term, they will be called 
axioms of quantification. Thus the axioms of quantification com¬ 
prise the closures of all tautologous formulae; also the closures of 
all formulae of the form 

r (a) ( <j> 3 ip) 3. (ac)p 3 (a)^ n ; 

also the closures of all formula r pD (ot)p’' such that a is not free 
in p; also the closures of all formulae r (a)p3p'~‘ such that p' is 
like p except for containing free occurrences of a! wherever p con¬ 
tains free occurrences of a. 


§ 16. Theorems 

IT IS APPARENT, equally from the ordinary usage of ‘if-then’ 
and from the truth table of ‘3’, that ip will be true if r p 3 ip~* and p 
are true. The form of inference which carries us thus from prem¬ 
isses r <t> 3 T 1 and 4> to the conclusion ip is known traditionally as 
the modus ponens; hence let us speak of ip as the ponential of 
r p 3 ip-* and <p. In general, let us call \p a ponential of <pi, fa, fa, . . . 
if one of pi, fa, fa ,... is a conditional having another of pi, fa, fa, 
... as antecedent and ip as consequent. Ponentials of true state¬ 
ments are true; and, in particular, ponentials of logically true 
statements are logically true. 
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Though the axioms of quantification comprise infinitely many 
logical truths, they leave further infinities of' logical truths un¬ 
touched. A broader class of logical truths is obtainable by throw¬ 
ing in the ponentials of the axioms of quantification; a still 
broader one is obtainable by throwing in the ponentials of the thus 
supplemented class; and so on. 

The axioms of quantification, together with the ponentials of 
those axioms, together with the ponentials of this further totality, 
and so on, will be referred to collectively as theorems. To say that tf> 
is a theorem is to say that there is a sequence of statements ^i, 

. . . , <t> each of which is either an axiom of quantification or a 

ponential of earlier statements of the sequence. The theorems can 
be characterized recursively as follows: (i) axioms of quantification 
are theorems, and (ii) if r <j> 3 i IP and <j> are theorems then so is 4'- l 

The following three statements, e.g., are theorems: 

(1) (y)((x)(y affects x) D. y affects y), 

(2) (y)(W(y affects x) D. y affects y) D. 

(y)(#)(y affects x) D ( y)(y affects y), 

(3) (y)0*0(y affects x) D (y)(y affects y). 

(1) and (2) are axioms of quantification, of the respective kinds D 
and B (cf. § 15). (3) is not an axiom of quantification, but it is a 
theorem because it is the ponential of the theorems (1) and (2). 
The sequence (1), (2), (3) is of the kind 4'u 4 / • • • > 4'n, 4> de¬ 
scribed above. 

The tautologous statements form one class of logical truths; the 
axioms of quantification form a broader class; and the theorems 
form a still broader class. This third class differs from the pre¬ 
ceding two in this important respect: we have no general pro¬ 
cedure whereby we can test whether a given statement belongs to 
the class or not. With respect to tautologous statements, such a 
test is provided by the truth table; given any statement, framed 

1 Any general notion which is resoluble into an infinite sequence of special cases 
is said to be recursively characterized when we have explained the first case and 
added a general rule describing the (? -f l)st ease, for each i, in terms of the first i 
cases. Here the axioms of quantification constitute the first case, and the ponen¬ 
tials of theorems falling under the first i cases constitute the (t -f l)st case. 
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in our logical notation, we can appraise it systematically by the 
tabular method and discover whether or not it is tautologous. 1 
With respect to the broader class of quantificational axioms, 
such a test is again forthcoming. Given a statement, we may first 
determine whether it is an axiom of quantification of the kind A; 
this is accomplished by dropping off any initial quantifiers, to get 
the net formula <j> whereof the original statement was the closure, 
and then testing <j> for tautology by table. If this investigation 
yields a negative answer, we next determine whether the state¬ 
ment is of the kind B; i.e., we observe whether <f> is a conditional 
whose antecedent and consequent are alike except for a distributed 
quantifier as in (2) of § IS. In case of a negative answer, we 
check the statement in similar fashion against the type C; then 
against D. 

But with respect to the class of theorems we have no such com¬ 
plete test. A statement </> may be a theorem without our being able 
to discover the fact; there may be a sequence fa, fa, . . . , \f/ n , <j> of 
the required kind without our being lucky enough to find it. 
Failure to discover such a sequence is no evidence that is not a 
theorem, though the chance discovery of such a sequence does show 
that <t>is n theorem. 

With respect to the class of theorems we do have a partial test, 
but only a partial one; namely, given any sequence fa, fa, . . . , 
\j/ n , fa we can decide whether it is of the kind which makes <j> a 
theorem. We can check each statement of the sequence, from fa 
onward, against each pair of its predecessors to see if it is a ponen- 
tial thereof; then, reserving the ponentials, we can check each of 
the remaining statements to see whether it is an axiom of quantifi¬ 
cation. This reflects the characteristic mathematical situation; 
the mathematician hits upon his proof by unregimented insight 
and good fortune, but afterwards other mathematicians can check 
his proof.* 

1 For this purpose it is ordinarily essential that the statement be given explicitly 
and in full, of course, rather than merely described by quasi-quotation. Forms 
exhibited in quasi-quotation are often capable of being shared by statements which 
are tautologous and statements which are not; cf. § 10. 

1 Cf. Introduction; also §§ 55, 60. 
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Let us write ‘|- <j>’ to mean that the closure of ^ is a theorem. 1 
Then the characterization of theorem embodied in (i) and (ii) 
above takes on the following form, once the notion of axiom of 
quantification is analyzed out. 

*100. If 4> is tautologous, |- <t>. 

*101. h r (a)(4> ^ 'I') 3- («)$ ^ 

*102. If a is not free in <j>, b r <t> 3 (a)0~ 1 - 

*103. If 4> is like <p except for containing free occurrences of a' 
wherever <f> contains free occurrences of a, then f- r (a)<p D 
*104. If r <t> D ft 1 and <j> are theorems , so is f. 

The principles *100-*i04 are to be understood, of course, as apply¬ 
ing to all choices of <t>, \p, a, and a'. The three-digit numerals and 
asterisks belong to a scheme of cross-reference which will become 
clear in the sequel. 2 

Note that *104 cannot be abbreviated to read: 

(4) If b r 4> D 4P and f- tj> then (- \p; 

this means something more complicated than *104, namely that if 
the closures of r 4> D ■f' and <j> are theorems then so is that of It 
will be found later (*111, § 17) that (4) does in fact also hold; it 
will be found that whenever r <j> D and 4> are formulae whose 
closures are theorems according to *100-*104, the closure of \j/ is 
also a theorem according to *100-* 104. But this result is no mere 
translation of *104; its substantiation requires a considerable train 
of reasoning. 

Frege was perhaps the first to distinguish clearly between axioms and the rirles 
of inference whereby theorems are generated from the axioms. Once this distinction 
is drawn, a recursive characterization of the class of theorems is virtually at hand. 
But the highly explicit way of presenting formal deductive systems which is custom¬ 
ary nowadays dates back only to Hilbert (1922) or Post (1921). Under this plan 
a class of expressions to be known as formulae is specified by reference to purely 

1 The sign f h ’ was borrowed from Frege by Whitehead and Russell. The sense 
which they attached to it is somewhat obscure, but comes near enough to this sense 
to justify my retention of the notation. 

2 A special sense will subsequently (§ 17) be attached to suppression of the 
asterisk; also an important distinction will be recorded by using daggers sometimes 
instead of asterisks (§ 25). Concerning the choice of numerals, see footnotes to 
§17. 
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notational features — ordinarily through a recursive characterization (cf. §§13, 
23); then a subclass of formulas to be known as theorems is singled out by another 
recursive characterization, referring again to none but notational features. The 
so-called axioms, i.e. those initial theorems which are introduced by the first of the 
two clauses in a recursive characterization of theorem, used to be taken as finite in 
number and hence presented by list; the practice of describing an infinity of axioms 
dates only from von Neumann (1927). It is this innovation that has enabled us to 
keep to modus ponens as sole rule of inference, following Tarski (1935). The com¬ 
bination of principles *100-*104 was anticipated in its main outlines by Fitch 
in 1938. 

In the first edition an additional kind of axioms of quantification was recognized, 
beyond the present kinds A, B, C, and D; viz., the closures of all formulae of the 
form D (d)(o<)4> Also the concept of closure differed slightly from the 

present one, in that the quantifiers were arranged in the opposite order. But Berry 
showed that by switching to the present version of closure we could dispense with 
those additional axioms and still get them as theorems. The proofs of *115 and *119 
below are due in principle to Berry. 

Most of the “meta theorems” running through the rest of the present chapter were 
recorded by one or another of Frege, Peirce, and Schroder, and comprehensively 
systematized by Whitehead and Russell (*9-*ll). 


§ 17 . Metatheorems 

STUDY of *100-*104 reveals an endless variety of general condi¬ 
tions under which statements will be theorems. By recording 
conditions of this kind once and for all, we avoid the labor of 
writing out specific sequences of quantificational axioms and ponen- 
tials to establish individual theorems falling under those conditions. 
We establish theorems wholesale, by arguments which show that 
the appropriate sequences could be found for each particular case. 
Such principles, describing general circumstances under which 
statements are theorems, will be called metatheorems. *100-*104 
themselves are our initial metatheorems. 

So long as the atomic formulae remain unspecified, indeed, it is 
impossible to cite any specific formulae (cf. § 13) and hence im¬ 
possible to cite any specific theorems. Apart from fictitious 
examples in which makeshift atomic formulae are borrowed from 
ordinary language (e.g. (3) of § 16), we can only cite the general 
forms of theorems — using Greek letters as in *100-*104. At a 
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later stage (§§ 25 ff.), after the specification of atomic formula, 
individual theorems will be forthcoming. 

The following metatheorem is merely a special case of *103'. 

* 110 . 1 (- r (a)4> D (jP. Proof: *103 (taking a'as a). 

A less trivial metatheorem, anticipated in earlier discussion 
(§ 16; see also Appendix), is this: 

* 111 . If [- r (f> 0 and |- <j> then |- V '>■ 

Proof. Let a number n be called favorable if *111 holds for all 
conditionals r <f> D \f/~' having n or fewer free variables. What we 
want to show is that every number is favorable. Suppose that k 
is favorable, and consider any conditional r 4> D \P which has just 
k + 1 free variables and is such further that 

(i) 

I- <t>- (2) 

Where a is alphabetically the first of those free variables, the 
closure of r <t> DrfP is also the closure of 3 ff 1 (cf. §14); 

hence, by (1), 

h r (a)(^D^r. (3) 

Since the conditional r (o:)(<#» 3 ^) 3. 3 {a)fP has just k free 

variables, and k is favorable, we see from *101 and (3) that 

h r (a)4> 3 (4) 

Now if a is free in <f>, it is alphabetically the first free variable of <#>; 

and in this case the closure of <j> is also the closure of r (a)4>' 1 , so that 

h r («)^ (5) 

by (2). If on the other hand a is not free in <£, the conditional 

r <t> D (a)<lP has at most k free variables; and in this case, since k is 
favorable, (5) is forthcoming in view of *102 and (2). In either 
case, thus, (5) holds. But the conditional r (a)4> 3 (a)4P has just 
k free variables; by (4) and (5), then, since k is favorable, 


1 The numeration of metatheorems will not be altogether consecutive. Com¬ 
monly, as in the present case, the numeral of the first metatheorem of a section 
will open a fresh decade. Numerals having ‘0’ in the decade position are reserved 
to the initial metatheorems *100-*104 of quantification and the initial meta¬ 
theorems *200-*202 of membership (§ 29). 
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1 1 - r (a)^. ( 6 ) 

Moreover, by *110, (- r (a)\f> D \{P. (7) 

Now if a is free in \f/, it is alphabetically the first free variable of 
and in this case the closure of \f/ is also the closure of r (a)^' 1 , so that 


I-* (8) 

by (6) alone. If on the other hand a is not free in \j/, r (a)ip D if/' 1 
has at most k free variables; and in this case, since k is favorable, 
(8) is forthcoming in view of (7) and (6). We thus conclude that 
k 4- 1 is a favorable number — having assumed that k was. But 0 
is favorable, by *104; hence, taking k as 0 in the foregoing argu¬ 
ment, we conclude that 1 is favorable; hence, taking k as 1, we 
conclude that 2 is favorable; and so on. *111 thus holds regardless 
of the number of free variables. 


In future proofs, *111 will be made the basis of a condensed no¬ 
tation which may be illustrated thus: 


b r 0 S)(iD4>r. (1) 

*101 K[1 3.] (2) 

Suppose line (1) given or previously justified. Now the numeral ‘ 1 ’ 
in line (2) stands as an abbreviation for the whole expression ‘ (0) 
(<(/ D^)’ which appears within corners in (1). Thus, if for the mo¬ 
ment we ignore the pair of square brackets, line (2) amounts to: 

and it is this line that *101 is cited to justify. Now from this line 
and (1) we can conclude by *111 that 

and it is this step that the square brackets express. Any subsequent 
citation of (2) refers to line (2) minus its bracketed part. 

The conventions involved may be summed up as follows. 
Square brackets indicate deletion, on the basis of *111, of an ante¬ 
cedent whose closure has been shown to be a theorem. A reference 
at the left of a line justifies the line inclusive of any bracketed 
matter. A parenthesized numeral at the right of a line L labels 
L exclusive of any bracketed matter. Divorced of its parentheses, 
this numeral serves in the sequel as an abbreviation of the whole 
expression (exclusive of any bracketed matter) which appeared 
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within corners in L. 

Sometimes, as in the line: 


b r [l 3: 2 3.] 0! 3 0 3 n 

which appears in the proof of the next metatheorem, square 
brackets indicate repeated use of * 111 . This line is merely a con¬ 
densation of two lines: 



h r [l3:]2 3. 0, 3 03-, 

(2') 

(2') 

b r [2 3.] 0i 3 4>p. 

112. //K0 1 3 0 2 '' 

, (- r 02 3 0 P, . . . , and |- r 0„_, 

3 0„ n then b r <j>i 3 0b" 1 . 

Proof, hp 1 

b r< £i ^ 0f- 

(1) 

hp 2 

b r ^>2 3 <h~'- 

(2) 

*100 

h r [l 3; 2 3.3 0i 3 03 -1 . 

(3) 

hp 3 

|- r 03 3 04 n . 

(4) 

*100 

b r C3 3: 4 3,] 0i 3 <j>P; and 

so on. 


The notation ‘hp 1 ’ refers to the first of the hypotheses in the 
metatheorem which is being proved; ‘hp 2 ’ refers to the second 
hypothesis, and so on. 

The citation of *100 calls upon the reader to observe that the 
formula depicted is tautologous, and hence that its closure is a 
theorem. The reader can always verify the claim of tautology, by 
truth table if not by inspection. In order that brief inspection 
may ordinarily suffice, the use of *100 will be limited to fairly 
simple forms of tautology: forms which, like (l)-(48) of § 11, are 
capable of depiction by means of not more than three distinct 
Greek letters and not more than seven Greek-letter occurrences. 

Use of *112 will ordinarily be tacit, through the medium of a 
notation of stacked conditionals. If metatheorems *m\, *m j, . . ., 
*m „-1 show respectively that f- r 0i 3 02 -1 , b r <fa 3 <t> 3 " 1 ,. . . , and 
b r <f>n -1 ^ <t>P, and we want then to draw the conclusion ( k ) that 
(- r 0 i 3 0 b" 1 , we depict the whole argument as follows: 

*nn b r 0i 3 02 n 

*w 2 3 03 n 


*m n -1 3 {k) 
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In the fourth line of the proof of *117 below, thus, the entry ‘ ( 2 ) ’ at 
the left is intended to show that (- r (a n )<j >} (<x n )'P', whereas what 
is labelled by the ‘(3)’ at the right of that line is rather 
(a„) V'" 1 ’. Similarly, in the third line of the proof of *116 below, the 
entry ‘* 101 ’ is intended to show that 

h r («)(0 ^ \f/) D. (a )0 

whereas what is labelled by the ‘( 2 )’ at the right of that line is 
rather: 

t- r («)(<£ = f) ( a)<t> 3( a)fP 

(bracketed matter being excluded as usual). 

A similar procedure of stacking biconditionals is justified by the 
following metatheorem. 

*113. IfY r <bi = <h\ h r </> 2 =<fcf, • • • > and f- r <#> n -i=<k,"' then f- r 4>i = W- 
Proof similar to that of *112. 

*114. (- r (ai)(a 2 ). . ,(a n )ij> D <jp. 

Proof. Case 1 : n - 0 . *100. 

Case 2: n > 0. *110 h r (ai). . .{ctf)4> (aj). . .{a n )4P 

*110 D (<*,).. .(ctn)<lP 

*110 3 4> _l . 

*115. If \-<f> then (- r (a)<fP. 

Proof. Case 1: a is not free in 4>. By *102, hypothesis, and 
*111, |- r (a)^. 

Case 2 : a is free in <f>. Let /3t> . . . , dm> “, 71 , ... , y„ be the free 
variables of 4> in alphabetic order, and let 5i, . . . , be a segment 
of the alphabet beyond all variables of <j>. For each i from 1 to m let 
us form <hi from <£ by changing /3 t , to 6 j, ... , i,-. Now the 

closure of 4>, by hypothesis a theorem, is simultaneously the clo¬ 
sure of r (/3i )4> lw , so 


*103 


KW- 
hTi W- 


(i) 
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But the closure of fa is simultaneously that of so 

b r (ft)V. ( 2 ) 

*103 I- r [2 3>' 1 . 

Continuing thus, we conclude that b <t>m■ But the closure of <£ m is 
r (5 m ). . .(Sl)(y n ). • •( 7 l)(“)<^ m "’; so 

b r (5 m ). • .(«i)(7»). • .(71 )(ct)<t> m \ (3) 

*103 b T3 3] («—i). • .(«.)(7»). • .(7i)(«)«^n l -i n . (4) 

*103 b T4 3] ( 8 *-*). • .(«.)( 7 »). • .( 7 .)(«)*«V. 

Continuing thus, we conclude that 

b r (7n). • .(71 )(<*)<£’’• (5) 

*114 b r [5 3] (a)4 n . 

* 116 . (- r (a)(<^> = D. ( a)<j> 33 (a)^" 1 . 

Proof. 

*100 H r <£ = ^D fa, ( 1 ) 

*101 K[(a)l 3.] («)(0 s *) 3 («)(* 3 

*101 3. (a)<f> 3 (a)fa. ( 2 ) 

Similarly f- r (a)(tf> = tfO 3. («)^ 3 (a)</f. ( 3 ) 


*100 )- r [^2 3:. 3 3:] = fa 3. (a)<£ 3 (cc)^ . (a)^ 3 {a)fa, 

q.e.d. (cf. D5). 

Attachment of a quantifier to a numeral within square brackets 
indicates that the application of *111 connoted by the brackets is 
to be preceded by use of *115. Thus the second line of the above 
proof embodies the following steps: 

(i), *115 h r («)r. (c) 

*101 K[1'3.](«)(* = *) 3 («)(* 3 *)->. 

An expression following a citation of *100 may exhibit Greek 
letters in excess of the prescribed limit of three distinct letters and 
seven occurrences, so long as the whole exemplifies some general 
tautologous form which falls within the limit. At the end of the 
above proof, *100 is cited to introduce an expression which (when 
the abbreviations ‘2’ and ‘3’ are supplanted) contains fourteen 
occurrences of Greek letters; the whole is an instance, however, of 
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the simple tautologous form 

r xi 3 xj -3:. x, 9 X3.3: xi 3. X 2 ■ xP 
(a variant of (42), § 11). 

*117. //(- r <£ 3 P, and. none of au ... , a n is free in <j>, then 

h r <t> 3 («i). • .{ccn)p. 

Proof, hp 1 f- r <j> 3 if 1 . (1) 

*101 h r [(«n)l 3-] (a„)<£ 3 (a„)p. ( 2 ) 

*102 (& hp 2 ) (- r 4> 3 (a n )p 

(2) 3 (a n )F. (3) 

The argument leading from (1) to (3) leads in turn from (3) to: 

1- r >3 (a n -i) 

thence to: }- r 0 D (a n . 2 )( a n ~i)(a n )p, and so on. 

Just as T’, ‘2’, etc. are used as abbreviations of the expressions 
appearing within corners in the lines ( 1 ), ( 2 ), etc. of a given proof, 
so ‘ 102 ’, ‘ 110 ’, etc. will be used as abbreviations of the expressions 
appearing within corners in the metatheorems * 102 , * 110 , etc . 1 
Thus ‘102 ’, ‘ 110 ’, and ‘118’, used in the next proof, are short for 
‘<t> 3 ( ai)<t>\ ‘(a )0 3$’, and ‘<t> — ( a)<f>’. 

*118. If a is not free in <t>, (- r <p — (ajp- 
Proof. *100 h r [102 3. 110 3] IW. 

*119. b r («)(/3)* m 


*114 

b W. 

(1) 

(1), *117 

b r («)m 3 (0(a)* 1 . 

(2) 

Similarly 

b r (0)(<x)<t> 3(a) (P)4> \ 

(3) 

*100 

b r [2 3. 3 3] U9 1 . 



1 This convention was suggested by my student Miss Leigh D. Steinhardt. 
It would not work if the numeration of metatheorems had begun with ‘ *1 ’ rather 
than ‘*100’, since *1, *2, etc. would cease to be distinguished from (1), (2), etc. 
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§18. Substitutivity of the Biconditional 

IN § 13 a restriction was imposed according to which one formula 
can occur in another only in a context of quantification or truth- 
functional composition. This restriction gives rise, it will be 
found, to the following convenient substitutivity principle: if <j> 
and P' are statements agreeing in truth value, then p' can be sub¬ 
stituted for any occurrences of 4> in any statement p without 
affecting the truth value of p. In other words, if the statements 
P and p' are alike except that p’ contains the statement <t>' in 
places where p contains the statement p, then p and P' are alike 
in truth value if <t> and p' are. In other words, (I) any statement 
of the form r p = P'. D . P = P n is true, where p f is like p except 
for containing <f>' in places where P contains <t>. E.g., the conditional: 

(1) Smith met Jones . = . Jones was in Omaha :D. 

(x) (Smith met Jones .V ~ (Smith sold x to Jones)) = 
(x) (Jones was in Omaha .v ~ (Smith sold x to Jones)) 

is true. 

Moreover, any such conditional is true independently of the 
particular names which happen to occur in it; replacement of 
names by free variables leaves a matrix which is (roughly speaking) 
true for all values of the variables —- a matrix whose closure is 
true. E.g., the matrix: 

(2) y met z . = . z was in w :D. 

( x)(y met z .v ~ (y sold x to zf) = 

(x)(z was in w -V ~ (y sold x to z)) 

has a true closure: 

(3) (z)(y)(wO(y met z . = . z was in w :D, 

( x)(y met z .V ~ (y sold x to z)) = 

(x)(z was in w .V ~ (y sold x to z))). 
This does not, however, entitle us to generalize (I) in the following 
fashion: (II) the closure of r p = <j>'. D . p = p n is true where 
p’ is like p except for containing <f>' in places where p contains <j>. 
For, many conditional matrices of the kind described in (II) do 
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not correspond in the intended fashion to the conditional state¬ 
ments described in (I). One is this: 

(4) x = x . = . x < y Q. (x){x = x) = {x){x < y). 

When names are put for the free variables of ( 2 ), we have a con¬ 
ditional statement — e.g. ( 1 )—of the kind described in (I); but 
when names are put for the free variables of (4), we get conditional 
statements such as the following, altogether alien to (I): 

(5) 0 = 0 . = . 0 < 1 O. (x)(x = x) = (x)(x < 1). 

Far from being a conditional of the kind described in (I), (5) 
is not even true — as is readily seen by observing the truth of 
< 0 = 0. = .0<1’ and ‘ (x) (x = x) ’ and the falsehood of 
‘ (x) (x < 1 )’ and then calculating the truth value of the whole 
by truth-tables. (II) is too strong; the matrix (4) is not true 
for all values of its variables, i.e. does not have a true closure, as 
the counter-instance (5) demonstrates. 

The essential difference between ( 2 ) and (4) is apparent. 
Roughly speaking, the trouble with (4) is that when the ‘x = x’ 
and 'x < y’ of the antecedent recur in the consequent they fall 
under new quantifiers which capture free variables of the ante¬ 
cedent and divert them to their own purposes. The occurrences 
of ‘x = x’ and *x < y’ in the consequent of (4) resemble those 
in the antecedent by virtue only of an alphabetic accident; for 
the occurrences of ‘x’ in ‘(x)(x: = x)’ and ‘(x)(x<y)’ are 
governed by the adjacent occurrences of the quantifier ‘(x)’, and 
have nothing but an alphabetic coincidence in common with the 
occurrences of ‘x’ in ‘x = x . = . x < y ’. (4) could be rendered 

equivalently as: 

( 6 ) x = v . = . x < y :D. (z)(z = z) = (tv)(w < y), to which 

(II) is obviously irrelevant. 

The corrected form of (II), which avoids such unintended cases 
as (4), is this: (III) the closure of r 4> = <t>' .3. ^ = i // 1 is true where 
\f/' is like except for containing free occurrences of <j>' at some places 
where f contains free occurrences of <t>. (I) is that special case of 

(III) which arises when r 4> = </>' .3. f is its own closure, i.e., 

a statement. 
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Nothing has been said thus far by way of establishing even (I), 
much less (III); all that has been shown is that (II) fails. But in 
the metatheorem *122 below it will be shown that the broad sub- 
stitutivity principle (III) holds, and more: the closures of the 
conditionals described in (III) are not merely true, but theorems. 

It turns out, moreover, that even such recalcitrant forms as (4) 
will hold if we modify them to the extent of attaching one or more 
appropriate quantifiers to the antecedent. In particular, whereas 
the closure of (4) as it stands is false, the closure of: 

(x) (x = x . =. x < y) D. ( x ) (x = x) = ( x ) (x < y ) 

is true. A generalization of (III) or *122 which covers such cases 
appears below as * 121 . This is proved ahead of *122, which then 
follows as a special case. 

Preparatory to proving *121 as a whole, it is convenient to prove 
the case where p' differs from p in point of just one occurrence of p'. 
This case takes the form of the following metatheorem. 

*120. If P' is like p except for containing p' at a place where p 
contains p , and ot\, . . . , a n (n ft 0 ) exhaust the variables with respect 
to which these occurrences of <j> and p' are bound in p and p', then 

\- r (<*i). . .(a n )(p = i p') 3. p s p n . 

Proof. Let p o, <f>i,... , pk (k ^ 0 ) be, in order of increasing 
length, all the formulae which occur in p and contain the occurrence 
of 0 in question; and let 0 ' o , 0 'i, . . . , be the corresponding 
parts of 0 '. From the general method of constructing formulae 
(§ 13) it is then apparent, for each i from 1 to k , that 0 ; and p'i are 
either r 0,. 1 J, and r 0 'i_i | x " 1 f° r some formula x (Case i ) or else 
r X i Pi- i ” 1 and r x | P'i-P (Case 2 ) or else v {&) pi-i 1 and r (/3) pf P for 
some variable (Case j). Now if it happens that 

|- r (ai). . .(a „)(0 = 0 ') 9. pi -1 = 0 Vi n ( 1 ) 

it will follow that 

|- r (<*iL . .(a „)(0 = 0 ') 3. pi = p'P, 
as will now be shown in each of the three cases. 

Cases I y 2 : (1) (- r (ai). . ,(<x„)(p = 0') D. pi -1 = pf-C 

*100 D. 0 t = 0 V. 


(2) 
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Case 3 . Any occurrence of 0 , within the given occurrence of 0 in 
0, will be bound to the occurrence of r (/3 )' 1 which begins r (/3) 4>i.p if 
not to a later occurrence of r (0}~'. Thus any occurrence of 0 
which is free in 0 will be bound to the occurrence of r (/3) _l which 
begins r ( 0 ) rpi-P or to some later occurrence of r ( 0 )~ l which is like¬ 
wise outside 0 . Hence if 0 is free in <j> then the occurrence of 0 is 
bound in 0 with respect to 0 . Then, by hp 2, 0 is among a\, . . ., 
a„. By similar reasoning, if 0 is free in 0' then 0 is among ai,. . ., 
a„. Thus, regardless of whether 0 is free in r 0 = 0 n , 0 will not be 
free in r (ai). . .(<*„) (0 = 0')" 1 . Hence 

(1), *117 b r (a,). . .(«„)(* a 0') 3 (0)(<t>i-i m 0TO 1 

*116 3.0<sb0V. 

In Cases 1-3 alike, therefore, and hence for each i from 1 to k, (2) 
holds if (1) does. But, since 0 O is 0 and 0' o is <j>, 

*114 b r («l). • .(d!n)(0 = 0') 3. 0o = 0V- 
Hence, taking i as 1, it follows that 

b r («i)- • -(“n)(0 = 0') 3. 01 = 0'f. 

Hence, taking i as 2, it follows that 

b r (<*i). ■ .(a n )(0 = 00 3. 02 =0V. 

By k such steps we conclude that 

b r (<*i). . .(a„)(0 = 0') 3. 0 fc = <t>'C. 

But is 0 and 0'* is 0'. 

The more general metatheorem *121 is now easily established. 

*121. If 0 ' is like 0 except for containing 0 ' at some places where 0 
contains 0, and ai , . . . , a n (n $5 0) exhaust the variables with respect 
to which these occurrences of <j> and 0' are bound in 0 and 0', then 
b r (ai)- • .(<*») (0 = 0') 3. 0 = 0'" 1 . 

Proof. Let those places be k in number; and, for each i from 0 
to k, let 0 * be like 0 except for containing 0' instead of 0 at the first 
i of the k places. Then, for each i from 1 to k, 0,- is like 0,_i except 
for containing an occurrence of <j>' where 0 ; _i contains an occurrence 
of 0; wherefore, in view of hp 2, 

*120 b r (ai). • .(«„)(0 = 0') 3. 0,-1 s 0 < T 


( 1 ) 
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*100 I- r [l 3:](ai). . .(«n)(<£ s <t>')-4'i- l-=- (“l)- • •(“»)(<£ = 0') 

Thus f- r (ai). . .(«„)(</> = 0') ■ ■ = ■ («i). ■ • (<*»)(<£ = <*>') ■ tf'f' 

= . (ai). . .(«„)(<£ = 4 >') . $•? 


= . (ai). . .(<*„)(<£ = 4 >') • 'f'k 1 - (2) 
*100 |- r [2 3:] (a,). . .(a„)(4> =<*>') 3. 

But is ^ and \f/ k is \j/'. 

*120 differs from *121 in requiring that just one occurrence of <j> 
in f/ give way to <j>' in if/'- This means, of course, just one occur¬ 
rence from the point of view of unabbreviated notation. Even 
where only a single occurrence of <j> appears in our definitionally 
abbreviated notations, expansion according to the definitions will 
invariably show that many occurrences are really present. In 
practice, therefore, *120 is not useful; the more general principle 
*121 is needed. 

The principle *122 discussed earlier follows from *121 as a 
special case. 

*122. If 1’ is like \p except for containing free occurrences of <j>' at 
some places where 1 contains free occurrences of <f>, then 

= 3 . + = iT. 

Proof: *121 (taking n as 0). 

*121 also has the following much used corollary, in which mention 
of freedom or bondage ceases to be needed: Iff- r 0 = then <t> 
and </>' are interchangeable anywhere. This takes the form of the 
following metatheorem . 1 

*123. If f- r 0 = 4 > n , and f is formed from f by putting <f>' for some 
occurrences of <t>, then (- r \p = \j/ n . 

Proof. Let on, . . . , a„ be all the variables of <f> and <j>'. 

hp l b r <b = <b n . (l) 

* 121 (& hp 2 ) K[(ai). ■ .(«n)l in. 

The following is a corollary in turn of *123. 

1 A proof covering this metatheorem is given hv Hilbert and Ackermann (Ch.3, §7); 
but they do not mention the stronger principles *121-*122. 



§19 EXISTENTIAL QUANTIFICATION 101 

*124. If\-<j> and b X- and x is formed from \ by putting for some 


occurrences of r cj> D 
r \p . <jP, then b x ■ 

or r <p = fp or r <p = cjp 

or r 4>. fP or 

Proof of the case of 



hp 1 

b </>• 

(1) 

hp 2 

b X- 

( 2 ) 

*100 

b r [l r- 

(3) 

(3), *123 

b r x = x'A 

(4) 

*100 

b r [4 3. 2 D] X T 



The other cases are proved similarly. 

*124 is made the basis of a powerful extension of the device of 
square brackets. Where b </>, *124 provides for the immediate 
deletion of r <t> T or r ^> = n or r =<f' or r <j> A or r . <jp from 
any position within any theorem. Square brackets, hitherto used 
only in the limited fashion justified by * 111 , can now be used to 
indicate deletions of all these kinds . 1 


§ 19. Existential Quantification 

TO SAY that nothing fulfills a given condition is to say that every¬ 
thing fulfills the denial of that condition. To affirm e.g. that 
there are no objects x such that at is distinct from itself, we may 
write: 

(x) ( x is distinct from x ); 

i.e., ‘No matter what x may be, ~ (x is distinct from x).’ To 
affirm that there are no black swans, we may write: 

( 1 ) ( x ) ~ ( x is black . x is a swan). 

In effect, thus, the composite prefix ‘ (x) amounts to the words 
‘there are no objects x such that’. 

To say that there are objects satisfying a given condition, we 
have only to deny that there are none. To say that there are black 
swans we have only to apply a denial sign to ( 1 ), obtaining: 

1 See the remarks on the proof of *136, § 19. 
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( 2 ) ~ (x) ~ (x is black . x is a swan). 

Just as ‘(a) amounts to ‘there are no objects x such that’, so 
‘~ (x) amounts to ‘there is at least one object x such that’, 
‘there is something x such that’. The parts of ‘ ~ (x) ~’ do not, 
of course, hang together as a unit; in ( 2 ) the second applies 
to: 

(x is black , x is a swan), 
the quantifier ‘ (x) ’ applies to: 

~ (x is black . x is a swan), 

and the first ‘ ~ ’ applies to the whole quantification ( 1 ). But the 
configuration of prefixes (x) ~’ figures so prominently in sub¬ 
sequent developments that it is convenient to adopt a condensed 
notation for it; the customary one is ‘ ( 3 x) ’, which we may read 
‘there is something x such that’. Hence the following abbreviative 
convention: 

D8. r (3c*V for r ~ (a) 

Prefixes of the form r (3a) n are known as existential (or particular) 
quantifiers; and, where distinction is necessary, the basic quanti¬ 
fiers r (aH are referred to as universal quantifiers . 1 

The first four of the ensuing metatheorems relate existential 
quantification with universal quantification in obvious fashion. 

*130. f- r ~(a)</> m (3 

Proof. *100 = ( 1 ) 

( 1 ), *123 j- r ~(a)<£ = ~(a)~~«j>" , ! q.e.d. (cf. D 8 ). 

Arguments of the above sort, wherein a tautology is adduced by 
*100 and then *123 is applied to it, will hereafter be rendered in a 
single line with the composite justification ‘*100, *123’; thus the 
above proof becomes: 

*100, *123 \- r ~ ( a)4> = ~ («)~ ~ (fP. 

From such a condensed formulation we can always discern the two 
components of the relevant tautologous biconditional (in this case 


1 The version — ,_1 of r (3a)~' goes back to Frege, 

was adapted from Peano by Whitehead and Russell. 


The notation r {3a) 1 
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4 > and r ~ ~ p 1 ), simply by comparing the two sides of the ex¬ 
hibited biconditional (in this case r ~ and r ~ (a)~ ~ <jp) 

and picking out their dissimilar ingredients. 

The formulae described in metatheorems and proofs are always 
to be thought of, theoretically, as written out in full without 
definitional abbreviations (cf. § 13). In practice, for the most 
part, we do not need to reflect on the details of the expanded form; 
but occasionally we do need to lift one or another abbreviation in 
order to recognize a particular step of proof. On such occasions 
the relevant definition is mentioned parenthetically, as in the 
above proof and the following one. Expansion of the abbreviation 
r (3a)~ l into r ~ (a)~ n , e.g., immediately reveals that the following 
metatheorem is covered by *100. 

*131. |- r ^ (3a)0 = (a)~ <jP. Proof: *100 (& D8), 

*132. f- r ~(«i) . . . (£*„)<£ = (3ai) . . . (3a„)~ </> n . 

Proof. *130 b r ~ («i). • •(««)<£ = (3<*i)~ (an). . .(a n )<^ 

*130, *123 = (3ai)(3«2)~(«3). •-(“n)^" 1 

*130, *123 - (3m). • .(3a„)~ <t>\ 

*133. I- r ~(3«i).. .(3«„)<*, ^( ai ).. ,(a n )~ <tP. 

Proof similar, using *131. 

The next two metatheorems are the analogues, for existential 
quantification, of *103 and *110. 

*134. If <f>' is like 4> except for containing free occurrences of a' 
wherever <j> contains free occurrences of a, then hVD(3 a)* 1 . 
Proof. *103 (& hp) (- r (a)''v'<£ 3 ~ 4> n - (1) 

*100 (& D8) |- r [l 3] 134" 1 . 

*135. |- r <£ 3 (3a)0 n . Proof: *134 (taking a' as a). 

A corollary follows. 

*136. |- r (a)tf>3(3a)<*> n . 

Proof. *100 (- r [110.135 .3] 136 1 . 

The bracketing involved in the above proof, which gets rid of two 
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numerals ‘110’ and ‘135’ at once, merely indicates repeated use of 
the simple bracketing device. It can be broken down thus: 

|- r 110 . 135 .3 136 1 , 1- r [110 .] 135 .3 136 n , 

|- r 135 3 136 1 , f- r [135 3] 136" 1 , 

or alternatively thus: 

1- r 110.135 .3 136 1 , \- aiO [. 135] .3 136 -1 , 

f- r 110D 136" 1 , }- r [110 D] 136 n . 

*124 is the justification of this bracketing out of ‘ 110 or 135’; 
and it justifies similar bracketing when ‘ = ’ occurs in place of the 
of these examples. The bracketing of ‘135 3’ and ‘110 3’, 
above, is justified by *124 and *111 indifferently. Even in cases 
of this sort ‘[n 3]’, however, *124 allows more freedom than *111 
allowed; matter bracketed out according to *124 may stand in 
the interior of a line of proof, whereas *111 served only for bracket¬ 
ing out the main antecedent of the whole line. 

The following metatheorem explains vacuous existential quanti¬ 
fication. It is the analogue of *118. 

*137. If a is not free in <f>, (- r <£ = (3a)<f> n . 

Proof. *118 (& hp) (- (j, — ( a )~ (1) 

*100 (&D8) \~ r [l 3] 137 1 . 

The next shows that consecutive existential quantifiers, like 
consecutive universal ones, are permutable. It is the analogue of 
*119. 

*138. (- r (3a)(3/3)<f> - (30X3 a)4>\ 

Proof. *100, *123 (& D8) \- r (3a)(3j8)<f> ■ ~ («)(/3)~ 0" 1 

*119, *123 

*100, *123 (& D8) ■ (3/3) (3 a)* 1 . 

The order in which consecutive quantifiers are written can make 
a difference, on the other hand, when one quantifier is existential 
and the other universal. ‘(3*)(y)(# = y)’ is false, e.g., whereas 
‘(y)(3#)(# — y)’ is true. It does turn out that r (3a)(/3)<£ n im¬ 
plies r (/3)(3a)<f>" 1 , but the converse fails. 

*139. K(3a)03)<f>3 G3)(3«)*1 
Proof. *101 h r [(/3)135 3.](/3)<» 3(/3)(3^. 


(1) 
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*100 (- r [l 3.3 ~ (/3)(3«)<£ 3 ~ (iS)^" 1 . (2) 

(2), *117 |- r ~03)(3«)* 3 («)~ (0)01 (3) 

*100 (& D8) f- r [3 3] 139 n . 

Now that the metatheorems have come to be scattered through a 
considerable number of pages, and an ever increasing number, the 
reader is advised to use the list at the end of the book in order to 
identify metatheorems cited in proofs. If a metatheorem is used 
only within the section in which it appears, it is not entered in the 
list; but in this case the reader needs only to glance back a page or 
two. The definitions are likewise assembled at the end of the book. 


§ 20 . Distribution of Quantifiers 

WE TURN now to some principles dealing with the distribution of 
a quantifier through a binary compound. 


*140. b r («)(4>-*) 

Proof. *100 b r <t> ■ t D (1) 

*101 b r [(«)n.]W(^f)3W. (2) 

Similarly b r (°d(4> • 'P) 3 (a)^ n . (3) 

*100 b r ( a )4> ■ (a) 1 /' -3 (a)^ 1 

*110 3 0 1 . (4) 

Similarly b r (a)<£ ■ (a)^ -3 (5) 

*100 b r [2 • 3 .3 :](«)(<£. D 3. ( a)<j>. (a)^" 1 . (6) 

*100 b T 4 ■ 5 -3:](a)<^ . (a)i/' .3.0. ijP. (7) 

(7), *117 b r (<*)<£ • M'b -3 (<*)(</>. ^)\ (8) 

*100 b r [6-8-3]140 n . 


*141. b r (3a)(</> v \f/) =. (3a)<£ v (3a)^'" 1 . 

Proof. *100, *123 b r (a) , ~ (<j> V = (a)(~ 4> . ~ ^ , )" 1 

*140 =. (a)~ <j> . (a)'—' (1) 

*100 (& D8) b r [l 3]14r. 

*140 shows that a universal quantifier can be distributed through 
a conjunction, and *141 shows that an existential quantifier can be 
distributed through an alternation. But distribution of a universal 
quantifier through an alternation is not in general valid, nor is 
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distribution of an existential quantifier through a conjunction. The 
biconditionals: 

(a?)(a? is red. V ~ (a? is red)) = . ( x){x is red) v (a?)~ ( x is red), 
(3#)(a? is red . ~ (x is red)) (3a?) (x is red) . (3#) ~ ( x is red), 

e.g., are false, as can be seen by observing the truth of: 

(x)(x is red .V ~ (x is red)), (3a?)(a? is red), (3a?)~ (a? is red) 
and the falsehood of: 

(3a?)(ac is red . ~ (x is red)), (a?)(a? is red), (a?) ~ (x is red) 
and then applying truth tables. 

*141 has the following corollary. 1 

*142. |- r (3a)(d> 3 rp) =. ( a)<t> 3 (Sar)^" 1 . 

Proof. *141 (& D4, 8) \- r (3 3 ^ =• M-4 > 3 (3 a)* 1 

. *100, *123 =. {a)4> 3 (3a)^. 

The citation of D4 and D8 in this proof calls attention to the 
fact that 

r (3a)('~ 4> V \j/) s. (3a)~ <t> V (3a)f 
and r (3a)(<t> 3 vt) m. ( a )~~ tj> 3 (3ct)^'" 1 

are the same formula under different abbreviations. 

*140 equates a quantified conjunction with a conjunction of 
quantifications; *141 does likewise with respect to alternation; 
and *142 does likewise with respect to the conditional. Various 
further principles of this kind also hold, but only when the bi¬ 
conditional is weakened to a conditional in one direction or the 
other. *101 is one such principle, and we now proceed to fourteen 
more. 

*143. [- r (oc)<j> V .3 (oc)(<j> V v!')" 1 . 

Proof. *100 f- r <f> 3. <t> v \fP. (1) 

*101 h r [(a)l 3.](a)<£ 3 (<*)(<£ V if) 1 . (2) 

Similarly |- r (a)\f 3 V ^) n . (3) 

*100 \- r [2.3 .3]143l 

1 This principle, though not so well known as others in this section, goes back to 
Schroder (vol. 3, p. 30). But its analogue r (a)(<f> 3 =. (3a)<£ 3 (a)^" 1 , also 

given by Schroder (loc. cit.), is invalid. Half of it does hold; cf. *148. 
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*144. |- r (a)(<£ V 3. (3a)<£ V (a)\lP. 

Proof. *101 (& D4, 8) |- r (a)(~ 3 ^) 3. (3a)0 V (a)^" 1 . (1) 

*100, *123 1- r [l = 3144" 1 . 

*145. |- r (a)(0 V*) 3. ( a)(j> V (3Qr)^* 1 . 

Proof. *144 (- r (a)(^ V 4>) 3. (3a)i^ V (a)0 n 

*100 3. ( a)<t> V (Sa)^" 1 . (1) 

*100, *123 (- r [l =]145 n . 

*146. |- r (a)<t> V (3a)l^ .3 (3a)(<£ V 

Proof. *100 [- r [136 3 f](a)<£ v (3a)^ .3. (3 a)4> V (3a)^ n . (1) 

*141, *123 f- r 146[> l] 1 . 

*147. |- r (3a)<£ v (a)^ .3 (3a)($ V Proo/ similar. 

*148. (- r (3a)0 3 (a)V' .3 (a)(4> 3 ^) n . 

Proof. *100 (& D8) |- r (3a)<£ 3 (a)ip .3. (a)~ $ V (a)^" 1 
*143 (& D4) 3 (a)(0 3 iA) n . 

*149. |- r (a)(<£ 3 i/') 3. (3a)<£ 3 (3a)^ n . 

Proof. *145 (& D4) |- r (a)(<£ 3 rf/) 3. (a)~ 0 V (Sa)^ 1 
*100 (& D8) 3. (3a)$ 3 (3a)^ 

*160. f- r (3 a)<j> 3 (3a)^ .3 (3a)($ 3 if/) 1 . 

Proof. *100 (& D8) |- r (3a)<£ 3 (3a)^ .3. (a)~ V (3a) ^ 
*146 (& D4) 3 (3a)(<£ 3 if/) 1 . 

*151. j- r (a)<£ 3 .3 (3a)(<£ 3 f/) 1 . 

Proof. *147 (& D4) (- r (3a)~ <f> v (a)^< .3 (3a)(<#> 3 i f/) 1 . (1) 

*130, *123 (& D4) f- r 151 [a l]" 1 . 

*162. |- r (a)(4>. \f/) 3. (3a)0 . (a)^" 1 . 

Proof. *100 |- r |[136 3 :] ( a)<f>. (a)ip .3. (3a)<£ .(a)^" 1 . (1) 

*140, *123 |- r 152 [_= lj" 1 . 

*163. (- r (a){<t >. \p) 3. (a) 4 >. (3a)^ n . Proof similar. 

*164. |- r (a)</> . (3a)^ .3 (3a) {<f> . 

Proof. *101 [- r (a)(0 3 ~ ^) 3. (a )<(> 3 (a)'-*' ifP. (1) 

*100, *123 (- r [^l = :3(a)~(<£ . f/) 3. ( a)<t> 3 (a)~ fP. (2) 
*100 (& D8) b r [2 3]154 n . 
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*155. b 

r (3a)* . 

(a)*.3 (3 a) (*.*)". 


Proof. 

*100 

b r (3a)* . (a)* .3. (a)* . (3a)*" 



*154 

3 (3a)(* . *)". 

(1) 


*100, * 

123 (- r [l =]155". 


*156. (- 

r (3a)(* 

. *) 3. (3a)* . (3a)*". 


Proof. 

*100 

b r * • * -3 

(1) 


*149 

b T(a) 1 3.](3a)(* . *) 3 (3a)*". 

(2) 


Similarly \- r (3a )(*. *) 3 (3a)*". 

(3) 


*100 

b r [2.3 .3]156". 



*143, *145, and *146 reveal a chain of four successively weaker 
formulae, each of which implies its successor: 


r (a)* V (a)*", r (a)(* V *)", r (a)* V (3a)*", r (3a)(* V *)". 

*143-* 156 and *101 reveal also five other such chains, thus: 

*143, *144, *147: 

r (^v(a)f, r (a)(* V *)", r (3a)* V (a)*", r (3 a)(*V*)"; 

*148, *101, *151: 

r (3a)* 3 (a)*", r (a)(*3*)", r (a)* 3 (a)*", r (3a)(* 3 *)"; 

*148, *149, *150: 

r (3a)* 3 (a)*", r (a)(*3*)", r (3a)* 3 (3a)*", r (3«)(*3*)"; 
*153, *154, *156: 

r (a)(*.*)", r (a)* . (3a)*", r (3o:)(<^ - V')” 1 , r (3a)* . (3a)*"; 

*152, *155, *156: 

r (a)(*.*)", r (3a)* . (a)*", r (3a) (0 . VO" 1 , r (3a>* . (3a)*". 

The next four metatheorems show that a quantifier covering a 
conjunction or alternation may equivalently be restricted to a 
single component in case the variable of quantification is not free 
in the other component. 

*157. If a is not free in *, r (a)(*. *) = . * . (a)*". 

Proof. *118 (& hp), *123 |- r 157 [= 140J 1 . 

*168. If oc is not free in *, |- r (3 a)(* . *) =. * . (3a )*". 

Proof. 

*137 (& hp), *123 |- r (3a )(*. *) 3. *. (3a)*[]: = 156]". (1) 

*118 (& hp), *123 H r * . (3a)* .3 (3a)(* . *)[:s 154]". (2) 
*100 J- r [l • 2 .3]158". 
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*159. If a is not free in p, \- r (a)(<t> V p) = . p V (a)i/' 1 . 

Proof. 

*137 (& bp), *123 \- r (a )(p v p) 3. p v («)^[:= 144] n . (1) 

*118 (& hp), *123 |- r p v (a)i/' .3 (a)(<f> v ^)[:= 143]" 1 . (2) 

*100 K[1 .■2.D]159‘ i . 

*160. // a is not free in p, f- r (3a)(4> V p) =. p V (Ba)^" 1 . 

Proof. *137 (& hp), *123 |- r 160 [= 141J 1 . 

In like fashion a quantifier covering a conditional may be re¬ 
stricted to the consequent if the variable of quantification is not 
free in the antecedent. This is already provided by *159 and *160, 
in view of D4. But a curious twist appears in the opposite case, 
where the variable of quantification is not free in the consequent: 
to confine the quantifier to the antecedent we must change it from 
universal to existential or vice versa. 

*161. If a is not free in p, |- r (a)(p 3 p) =. (3 a)p 3 fP. 

Proof. 

*137 (& hp), *123 b r (a )(p 3 P) 3. (3*)p 3 p [:= 149J. (1) 
*118 (& hp), *123 h r (3a)<£ 3 P .3 {a )(</> 3 *)[:* 148]" 1 . (2) 
*100 bTl • 2.3]16r. 

*162. If a is not free in p, (- r (3a)(p 3 \p) =. ( a)p 3 p 1 . 

Proof. *137 (& hp), *123 (- r 162 [= 142J. 

The following corollary of *161 proves useful. 

*163. If b r p 3 i/T and none of ai, ... , a„ is free in \p, then 
b r (3ai). . .(3a n )p 3 p' 1 . 

Proof, hp 1 b r< t> 3 P\ (1) 

*161 (& hp 2) b r C(«») 1 = .](3«»)* 3 Pf (2) 

*161 (& hp 2) b T(“n-i) 2 =.](3«n-i)(3a»)<(> 3 p i, 
and so on. 

§21. Alphabetic Variance 


VARIABLES, as remarked (§ 12), serve merely for cross-reference 
to various positions of quantification. The particular choice of 
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letters which happens to be made, in constructing a statement, is 
immaterial to the meaning so long as the system of cross-references 
remains the same. The statements: 

(1) (x){y)(x < y .3 (z)(z < x .D. z < y}), 

( 2 ) (x)(y)(x < }’ .3 (w)(w < x . 3 . w < y)), 

(3) (x)(z)(x < z .3 (w)(w < x . 3 . tv < z)), 

(4) (y)(z)(y < 2-3 (w)(w < y . 3 . w < z)), 

e.g., are equivalent; they differ from one another only in an acci¬ 
dental detail of notation. If instead of using variables we were to 
indicate the cross-references by the method of curved lines (cf. 
§ 12), differences of this sort would drop out altogether. The de¬ 
velopment of a technique for recognizing such pureh - alphabetic 
variants as (l)-(4), and interchanging them at will, is the price 
which we must pay for a system of notation which is in most re¬ 
spects more convenient than its known alternatives. The present 
section will be devoted to paying this price. 

Each of the statements (1)—(4) is transformed into the next by 
relettering a constituent quantification r (a)i/3; i.e., by replacing 
r (a)^' 1 by r (a')i p n 1 where \p' is like 4/ except for exhibiting a' 
in place of a. But such relettering cannot always be depended 
upon to preserve equivalence. The rewriting of ‘z’ in (1) as ‘iv’ 
produces (2), which is indeed equivalent; the rewriting of ‘ z’ as 
‘y \ however, would have produced: 

(5) 0)600 < y -3 (y)(y < x . 3 . y < y)), 

which, far from being equivalent to the truths (1) and (2), is false. 
The relettering which leads from (1) to (5) alters the pattern of 
cross-references; the last occurrence of *y* in (5) refers back to the 
last quantifier occurrence in (5), whereas the correspondingly 

1 Strictly speaking the conventions of quasi-quotation fall into ambiguity at this 
point, because of the fact that Greek and Larin letters are both eligible to accentu¬ 
ation. Is the quantifier in r (a')lr to be understood as the result of putting the 
variable a (e.g. ‘y’) in the blank of *( )’, or is it to be understood rather as the 

result of putting the variable a (e.g. ‘.v’) in the blank of ‘ ( ) ’? The former, 

obviously, is the intention. In general, let us understand accents in quasi-quota¬ 
tion as belonging thus always to the Greek letters nather than to the contextual 
framework. 
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placed occurrence of ‘y’ in (1) refers back rather to the second 
quantifier occurrence in (1). 

In general, if the replacement of r {a) 0" 1 by r (<x')4''~' is not 
to affect the cross-references, the free occurrences of a in \p' 
must exactly match the free occurrences of a in 0; for it is just 
those occurrences that refer back to the initial occurrence of the 
quantifier in r (a)\p~' or r (a')0' n (cf. (ii), § 14). Quantifications 
r (a and r (a')xp 1 ^ which are so related will be called immediate al¬ 
phabetic variants of each other. In full: r (ct)if/~ l and r (a')\p n are im¬ 
mediate alphabetic variants if \p and 0' are alike except that i p’ 
contains free occurrences of a at all and only those places where 0 
contains free occurrences of a. Thus the quantification: 

(zv){w < x .3. w < y), 

which appears in (2), is an immediate alphabetic variant of the 
quantification: 

(z)(z < x .3. z < y) 

of (1); on the other hand this is not true of the quantification: 

(y) (y < a .3. y < y) 

which appears in (S). 

Now the kind of relettering which leads from (1) to (2), from 
(2) to (3), and from (3) to (4) can be described as follows: it con¬ 
sists in replacing a constituent quantification by an immediate 
alphabetic variant thereof. Formuke which are intertransformable 
by one or more such steps will be called, in general, alphabetic 
variants. Thus (1)—(4) are all alphabetic variants of one another; 
(5), on the other hand, is an alphabetic variant of none of them. 
The rigorous formulation is this: formulae 4> and 0' are alphabetic 
variants if there are formula- <j> n , <f> i, . . . 0„ such that <f> a is 0, 0„ is 0', 
and, for each x from 1 to n, <f>,- is formed from <p,-\ by replacing some 
constituent quantification r (a,)0, 1 by an immediate alphabetic 
variant r (a';)0V thereof. 

In the above explanation it was not required that 0 and 0' be 
statements; the notion of alphabetic variant applies to statements 
and matrices indifferently. The relettering involved in passing to 
an alphabetic variant of 0 is always, however, a relettering of 
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occurrences which are bound in <j>; we rewrite the variable within 
a quantifier and at such subsequent occurrences as refer back to 
that position of quantification. Thus matrices <j> and <j> are not 
classed as alphabetic variants if they differ from each other in 
point of free occurrences of variables. This is as it should be; the 
particular choice of letters is indeed immaterial to a statement so 
long as a certain general structure is maintained, but this is not 
true of the free variables of a matrix. The matrices ‘y = x' and 
‘y = z’ actually differ in meaning — in this important sense: re¬ 
placement of the one by the other can turn a true context into a 
falsehood. It turns the truth: 

(6) (x)(y)(z)(x = y . y = z .D. * = z), 

e.g., into the falsehood: 

(*)(y)(z) (x = y . y = x .D. x= s). 

A matrix <j> (e.g.‘y = z ’) is a fragment of one or another statement p 
(e.g. (6)), and any free variable a of <j> is indeed a bound variable 
of xp. But any rewriting of a which preserves the meaning of f/ 
must touch occurrences of a outside </>; the fragment <j> itself and 
the relettered fragment <f >' (e.g. ‘y = x’) are not interchangeable. 

Where 4 > is a statement and <£' is an alphabetic variant thereof, 
the biconditional r 0 = will of course be trivially true. More¬ 
over, it will be a theorem. This will now be proved, and more: 
where p is a matrix rather than a statement, the closure of the 
biconditional is still a theorem. 

First we attend to the case of immediate alphabetic variants: 
*170. If r (a)\[P and r (a')p n are immediate alphabetic variants, 
h r (a)P m (c!)4n. 

Proof. By hp, p and p’ are alike except that p’ contains free 
occurrences of a' at all and only those places where \(/ contains free 
occurrences of a. Hence, by *103 and *117, 

b r (a)^ 3 (a’W^, (1) 

b r WW ^ (a)fP. (2) 

*100 K[1. 2 .3] 170 n . 

The extension to the case of alphabetic variants in general is 
then obvious: 
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* 171 . If <j> and fa are alphabetic variants , (- r 4> = fa''. 

Proof. By hp, there are formulae fa, <j>i, . . . <p n such that fa 
is <f>, 4> n is fa, and, for each i from 1 to n, <f>i is formed from by 
replacing a quantification r (a.) i/Y by an immediate alphabetic 
variant r (a!i) fa P thereof. For each i, by *170, 
h r (a,) fa = (a\) V?, 
and hence, by *123, F r </>, i = <j>P. Thus 

h r <f>o = <t>P 
= </>2 n 

= fa 1 , q. e. d. 

The transformation of a formula 4> into an alphabetic variant fa 
runs through a series of stages <j >, fa, fa, . . . fa~u fa- Each com¬ 
ponent step of transformation consists in supplanting some 
quantification r (a.) fP by an immediate alphabetic variant 
r (oT) i p'p thereof. The end result fa differs from <f> with respect to 
many variables, in general, and fa, fa, . - . <f> n -i are the interme¬ 
diate results of making the alphabetic changes one at a time. 

It might be supposed, then, given <f> and fa, that we can always 
supply the intervening stages <f>i, fa, . . . , fa. i by the simple process 
of putting the new letters for the old ones one at a time in an 
arbitrary order — say in the order of occurrence of the quantifiers 
concerned. Actually the matter is not so simple. Let us return to 
the examples (l)-(4) at the beginning of the section. (1) and (4) 
are known to be alphabetic variants; and (4) differs from (1) in 
exhibiting ‘y\ ‘z’, and ‘w’ instead respectively of ‘ x’, ‘y’, and 
‘z’. Yet we cannot get from (1) to (4) by first putting ‘y’ for ‘x’, 
then ‘s’ for ‘y \ and then ‘tv' for ‘s’. The first of these steps leads 
from (1) to: 

(y)(y)(y < _v -3 (z)(z < y .3. z < y)% 

which is in fact not an alphabetic variant of (1); and if we put ‘ z’ 
for ‘y’ in this result and ‘tv’ for ‘z’ in the new' result we end up not 
with (4) but with the monotonous truism: 

(iti)(w)(w < w .3 (tv)(to < tv .3. tv < tv)). 
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That (1) and (4) are alphabetic variants is shown rather by the 
sequence (1), (2), (3), (4) — the sequence obtained by rewriting 
‘z’ first as ‘in', then ‘y’ as ‘z’, then ‘x’ as ‘y’. 

Again, consider the following pair of alphabetic variants: 

(7) {x){x is a number .3 ~ ( y){x = x + y)), 

(8) (y)(y a number .3 ~ (^c)(y = y + *))■ 

(8) differs from (7) in exhibiting ‘y’ and ‘x’ instead respectively 
of ‘ x’ and ‘y’; yet an appropriate sequence fa fa, . . . fa-i, fa 
joining (7) and (8) can be found neither by first changing ‘x’ to ‘y’ 
and then changing ‘y’ to ‘x’, nor by following the reverse order. 
The desired connection between (7) and (8) can be made only 
through intervening stages such as the following, which involve 
temporary introduction of a new letter ‘z’: 

(9) (z)(z * s a number .3 ~ (y)( z = z + y)), 

(10) (z)(z is a number .3 ~ (*)(z = % + *)). 

The statements (7), (9), (10), (8) do constitute a sequence </>, fa, 
. . . , fa. i, fa of the appropriate kind. 

Returning to the general case, let fa be an alphabetic variant of 
<j> containing a ',, a' 2 , ... a'k in place respectively of ai, <* 2 , . . . a*. 
We see that the connecting links fa, fa, etc. which establish alpha¬ 
betic variance cannot in general be found by the simple expedient 
of putting a 1 for ai in <£, then a' 2 , for a 2 in the result, and so on. 
This expedient has been seen to fail where <j> is (1) and is (4), and 
again where <j> is (7) and <#>' is (8). Any number of other examples to 
the same purpose could be constructed, sharing always this feature: 
various of «i, a 2 , . . . a* reappear among a\, a' 2 , . . . a'*. 

A method is apparent, however, which reveals suitable links 
<f> 1 , fa , etc. in the general case. It turns on temporary introduction 
of new letters as in (9) and (10). Where 0i, /3 2 , . . . are any vari¬ 
ables which are distinct from one another and from all variables of 
<t> and <£', construct the sequence <fn, fa, . . . fa by putting 0 1 for co 
in fa 02 for <x 2 in the result, and so on; then form the continuation 
fa+u <t>k* 2 , • • • fak-i, <t>' by putting a! 1 for 0 i in fa, a '2 for 02 in the 
result, and so on. The total sequence fa fa, fa ,... , fak-i, fa does 
establish alphabetic variance of <j> and fa. Where <f> and fa are (1) 
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and (4), e.g , and /3i is chosen as 'x'\ /3 2 as ‘y'\ and fa as V, the 
described method supplies <pi, 0 2 > $3, and 4>h as follows: 

(11) ( x’)(y)(x' < y .3 (z)(z < x' .3. z < y)), 

(12) (■»')(>•')(#' < v' .3 (z)(z < x' .3. z < }•','), 

(13) {x'){y'){x' < y' .3 (z')(z' < a-' .3. z' < >■')), 

(14) (y)(y')(y < y' .3 (z')(z' < y .3. z' < y')), 

(15) < z -3 (z')(z' < >’ -3. z' < z)). 

Each succeeding formula of the sequence (1), (11), (12), (13), (14), 
(15), (4) differs from its predecessor merely through replacement of 
a constituent quantification by an immediate alphabetic variant 
thereof; the alphabetic variance of (1) and (4) is thus verified. 

This general method delivers somewhat longer sequences than 
necessary. We know, e.g., that the sequence (1), (2), (3), (4) 
would suffice in place of the sequence (1), (11) -(15), (4). But the 
method has the virtue of simplicity in general formulation. Its 
existence will excuse us, in future pages, from citing intervening 
formula? 0i, 4 >•>, etc. by way of establishing any alleged case of 
alphabetic variance. The reader can be left always to check the 
matter for himself, by the general method if not by inspection. 

T he discussion of alphabetic variance has been limited to vari¬ 
ables governed by universal quantifiers, but it carries over auto¬ 
matical!} to variables governed by existential quantifiers; for an 
existential quantifier is merely shorthand for a universal quantifier 
with a couple of denial signs. 
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§ 22 . Class and Member 


IN CHAPTER I connectives were studied which answer to the 
statement connectives ‘neither-nor’, ‘if-then’, ‘and’, etc. of 
ordinary discourse. In Chapter II the essential notions were 
formulated which underlie the common use of the words ‘what¬ 
ever’, ‘all’,'‘there is’, ‘some’. We now turn to a connective 
which embodies the principal meaning of the ambiguous word 
‘is’. 1 

Sometimes ‘is’ has the sense of ‘ = ’, or ‘is the same as’; such is 
its sense in ‘Paris is the capital of France’, ‘Tully is Cicero’. 
But in ‘Paris is a city’ or ‘Tully is wise’ or ‘Socrates is wise’ the 
word cannot be so construed; from ‘Tully = wise’ and ‘Socrates 
= wise’, indeed, we could infer that Tully = Socrates. In such 
contexts ‘is’ expresses rather possession of a property, or member¬ 
ship in a class: Paris belongs to the class of cities, and Socrates 
belongs to the class of wise beings. It is this sense of ‘is’ that is 
rendered symbolically by the connective V: ‘Paris e city’, 
‘Socrates e wise’. The connective ‘«* is (like ‘ =’, ‘hates’, etc.) a 
binary predicate (cf. § 5); whereas ‘ j ’, ‘3’, etc. yield state¬ 
ments when put between statements, V yields statements when 
put between names. 

In the examples which have been cited for one purpose or an¬ 
other in the course of foregoing sections, the names which figure 
most prominently are ‘Jones’, ‘Smith’, ‘Socrates’, ‘Boston’, 
‘Paris’. But the names which is used to connect are not ex¬ 
clusively of this sort. What is ordinarily wanted to the right of ‘e’ 
is a general term, e.g. ‘wise’ or ‘city’, rather than the name of 
some specific man or place. It is convenient, however, to regard 
such general terms as names on the same footing as ‘Socrates’ and 
‘Paris’: names each of a single specific entity, though a less 
tangible entity than the man Socrates or the town Boston. 2 The 
word ‘wise’ may be treated as name of the class of wise beings, 

1 The notation ‘e’, short for ‘iari’, is Peano’s. 

2 For a divergent, more philosophical account, see my Methods, §§34, 38. 
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taken as a single object of an abstract kind; and ‘city’ may be 
treated as a name of the class of cities. The statement ‘Socrates e 
wise’, now, says something about two objects, a man and a class; 
namely it says that the one is a member of the other. Correspond¬ 
ingly for ‘Paris « city’. 

The reassuring phrase ‘mere aggregates’ must be received 
warily as a description of classes. Aggregates, perhaps; but not 
in the sense of composite concrete objects or heaps. Continental 
United States is an extensive physical body (of arbitrary depth) 
having the several states as parts; at the same time it is a physical 
body having the several counties as parts. It is the same concrete 
object, regardless of the conceptual dissections imposed; the heap 
of states and the heap of counties are identical. The class of states, 
however, cannot be identified with the class of counties; for there 
is much that we want to affirm of the one class and deny of the 
other. We want to say e.g. that the one class has exactly 48 
members, while the other has 3075. We want to say that Delaware 
is a member of the first class and not of the second, and that 
Nantucket is a member of the second class and not of the first. 
These classes, unlike the single concrete heaps which their members 
compose, must be accepted as two entities of a non-spatial and 
abstract kind. 

Once classes are freed thus of any deceptive hint of tangibility, 
there is little reason to distinguish them from properties. It mat¬ 
ters little whether we read ‘ x e y’ as 'x is a member of the class y’ 
or ‘ x has the property y’. If there is any difference between classes 
and properties, it is merely this: classes are the same when their 
members are the same, whereas it is not universally conceded that 
properties are the same when possessed by the same objects. The 
class of all marine mammals living in 1940 is the same as the class 
of all whales and porpoises living in 1940, whereas the property of 
being a marine mammal alive in 1940 might be regarded as differing 
from the property of being a whale or porpoise alive in 1940. But 
classes may be thought of as properties if the latter notion is so 
qualified that properties become identical when their instances are 
identical. Classes may be thought of as properties in abstraction 
from any differences which are not reflected in differences of in- 
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stances. For mathematics certainly, and perhaps for discourse 
generally, there is no need of countenancing properties in any other 
sense. 

Discourse in general, mathematical and otherwise, involves 
continual reference to abstract entities of this sort — classes or 
properties. One may prefer to regard abstractions as fictions or 
manners of speaking; one may hope to find a method whereby all 
ostensible reference to abstract entities can be explained as mere 
shorthand for a more basic idiom involving reference only to con¬ 
crete objects (in some sense or other). 1 Such a nominalistic 
program presents extreme difficulty, if much of standard mathe¬ 
matics and natural science is to be really analyzed and reduced 
rather than merely repudiated; however, it is not known to be 
impossible. If a nominalistic theory of this sort should be achieved, 
we may gladly accept it as the theoretical underpinning of our 
present ostensible reference to so-called abstract entities; mean¬ 
while, however, we have no choice but to admit those abstract 
entities as part of our ultimate subject matter. 

We are thus to recognize as names of entities not only such ex¬ 
pressions as ‘Nantucket’, ‘the capital of France’, ‘the northern¬ 
most chimney of Craigie House’, but also general terms, conceived 
as naming abstract classes. And when through the medium of 
quantification we affirm something as true of every object x, we 
mean it to apply not merely to every spatially extended concrete 
object x, but to every object, abstract or concrete, class or indi¬ 
vidual. 

Our working ontology is thus pretty liberal. But in mitigation 
it may now be said that this is the end; no abstract objects other 
than classes are needed — no relations, functions, numbers, etc., 
except insofar as these are construed simply as classes. In addition 
to concrete objects we need recognize only classes having such 

1 See my “Designation and Existence,” “Logistical Approach,” and “Theory 
of Classes.” Note that Russell’s contextual definition of class names (“Mathe¬ 
matical Logic”; also Principia, vol. 1, pp. 71 ff, 187 ff) does not dispense with 
abstract entities, but only eliminates so-called classes in favor of properties. This 
point is obscured by ambiguity of the phrase ‘propositional function’, which some¬ 
times means ‘property’ and sometimes 'statement matrix’. 
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objects as members, then classes whose members are drawn from 
the thus supplemented totality, and so on. This is presumably all 
the ontology that is needed for discourse in general; certainly it is 
all that is needed for mathematics. 

It is by no means clear what objects are to be regarded as con¬ 
crete. E.g., should we regard men as concrete objects, or should we 
regard events as concrete objects and then explain men as classes 
of events? For present purposes it does not matter, for there will 
be no occasion in the formal developments to speak specifically 
of concrete objects. It would even be possible, compatibly with 
the projected formal developments and indeed with the whole of 
mathematics, to repudiate concrete objects altogether — to recog¬ 
nize just classes, each of which has classes in turn as members or 
else no members whatever. Such an ontology would recognize an 
empty or null class, a class whose sole member is the null class, a 
class whose sole member is the latter, and so on; also, classes whose 
members are variously chosen from the latter series; also, classes 
whose members are variously chosen from the thus supplemented 
totality; and so on. 1 This exclusively abstract ontology has little 
naturalness to recommend it, but there is no need here to reject 
or accept it. 

Where y is a class, ‘x e y’ means that x is a member of it. But 
what does it mean where y is not a class? We are free to decide 
this question as we like, by arbitrary supplementary interpretation 
of the sign V. The decision which seems to offer the greatest 
simplicity, as measured by the general principles produced, is this: 
where y is not a class, ‘xty’ means ‘x = y’. Hence ‘xty’, 
briefly read ‘ x is a member of y ’, is to have as its full translation the 
following: ‘x is a member of y or is the same as y according as y is 
or is not a class’. We may continue to use the brief reading if we 
agree that ‘is a member of y’ is to mean ‘is the same as y’ when 
y is not a class. 

The meaning of ‘ t’ reduces to that of ‘ = ’ just in the extreme 

1 Cf. Fraenkel, “Untersuchungen,” pp. 255f. The essential principle of this 
ontology might, in view of the developments which are immediately to follow, be 
given the symbolic rendering ‘ (y)^(^)Cv « y . =. x = y) though this version is 
not suited to Fraenkel’s own system. 
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case where the second object involved is not a class. Thus it does 
not follow that V will in general serve the purposes of “ = ’; the 
latter connective continues to be needed to carry the meaning ‘is 
the same as’ as applied to classes and non-classes generally. It 
does turn out, however, that ‘ = ’ is dispensable as a primitive 
connective; it can be defined in terms of ‘ J, ’, and quantifica¬ 
tion (cf. § 25). 

Note further that the notion of class itself is not needed as a 
separate assumption; ‘y is a class’ can be paraphrased in terms 
exclusively of V, ‘ J, ’, and quantification. For, under the supple¬ 
mentary interpretation of ‘t just now adopted, the distinguishing 
feature of a non-class y is this: ‘xey’ amounts, for every object 
x, to '* = y’. Hence, once having defined ‘ = ’ in terms of ‘t’, 
| ’, and quantification, we can paraphrase ‘y is not a class’ as: 
(x)(x t y . = . x = y) 

and ‘y is a class’ as: 

= y). 

But none of these considerations, having to do with the inter¬ 
pretation of ‘xey’ in cases where y is not a class, will figure ex¬ 
plicitly in the formal developments which are to follow. Situations 
will never arise where an ohject: y is known to be a non-class. As 
mentioned, the formal developments are compatible with the non¬ 
existence of entities other than classes; and under this extreme 
alternative the adopted supplementary interpretation of V evap¬ 
orates altogether. 


§ 23 . Logical Formulae 

THE CONNECTIVE of membership is adopted as part of our 
primitive logical notation, along with the connective ‘ J, ’ of joint 
denial and the quantifiers and variables which constitute the 
notation of quantification. By putting V between variables, we 
obtain formulae r a e /3 n — e.g. ‘ x t x\ ‘xty’, ‘yes’ — which are 
atomic in the sense of having no other formulae as parts. This is 
the first time atomic formulae have come to hand. Hitherto, 
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though formula in general were described as built up of atomic 
formula by joint denial and quantifiers, the atomic formula 
themselves were left unspecified (cf. § 13). 

Strictly, these newly acquired atomic formula should be de¬ 
scribed not as r a e /3" 1 but as r (ae|3) n ; for V, like all binary 
connectives, is to be construed as having a pair of parentheses as¬ 
sociated with it. In practice, under the working conventions ex¬ 
plained in § 7, r (a e 0) 1 loses its parentheses when it stands alone 
or as a component of a conjunction, alternation, conditional, or 
biconditional. 1 

Hitherto no actual expressions could be cited as formulae, be¬ 
cause of the fact that the atomic formulae were not given. Now, 
however, one important sort of formulae is definitely fixed; it 
comprises atomic formulae of the kind r (ae/3) n together with all 
other formula; whose atomic parts are of this kind. Such formulae 
will be spoken of henceforward as logical formula. Thus the 
logical formulae comprise, first, all expressions obtainable by 
putting variables ‘x’, ‘y’, etc. in the blanks of 1 ( t )’; 
second, all expressions obtainable by putting expressions of the 
first totality in the blanks of ‘ ( J, ) ’ or after a parenthesized 
variable; third, all expressions obtainable by putting expressions of 
the thus supplemented totality in the blanks of ‘( J. )’ or 
after a parenthesized variable; and so on. Synoptically: r (a e(3) n 
is a logical formula, and if <f> and f are logical formulae then so are 
r (<t> 1 f') 1 and r (y )<tP. 

Thus the logical formulae comprise such expressions as: 

(xex), (xty), {ztw), (x){xtx), ( x)(xey ), 

(y)(xty), (x)(y)(xey), ((x « x) (x t x)), ((x e x) J, (x e y)), 

(0 e y) I (y « x)), ({xty) l(zew)), (x) {(x e x) J. (x e x)), 
(( x)(x e x) | (y)(x t y)), (w)(x)(((x t x ) J, (y){x t y)) j (x t w)), 

1 Actually the notation r ae0 1 could wholly supplant r (a(0) 1 at this stage 
— even in the absence of special conventions such as those of § 7 — without danger 
of ambiguity. Coupled with notations subsequently to be introduced by definition, 
however, this course would result in ambiguity, as my student K. R. Symon has 
noted. Parentheses are needed e.g. to distinguish € 5)^ from r a$(y<t> e 5)^ 

(cf. p. 202). 
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etc. Such of the logical formula as have no free variables — e.g. 
the fourth, seventh, twelfth, and last of the above list — constitute 
the logical statements, true or false. These four examples are, as it 
happens, all false. 

There are no atomic logical statements, for the atomic logical 
formula r (a e j3) n have free variables. The shortest possible 
logical statements are those which are closures of the one-variable 
matrices r (a e a) n ; in other words, ‘{x){xex) > and its alphabetic 
variants ‘ (y) ( 3 ’ e y) ‘ (z) (z t z) ’, etc. These statements happen, 
incidentally, to be false; for they say that every entity (and hence 
every class) is a member of itself. Many classes are indeed mem¬ 
bers of themselves, but many also are not; the class of cats, e.g., 
is not a cat. 

The shortest logical statements which are true are: 

(1) ((*)(* t x) J, (a)(a € *)) 

and its alphabetic variants, e.g.: 

((z)(ztz) I (y)(y « y)). 

The truth (1), i.e. ‘ ~ (x) (x e x) denies that every entity is a 
member of itself—though allowing that some may be. This 
truth will not appear as a theorem until a considerably later point 
(t240, §32). 

It will be noted that the logical formulae contain no names — not 
even names of abstract entities, classes. In the atomic formulae 
r (a e/3 ) 1 the a and /3 are never names, but merely ‘ x’, ‘y’, ' 2 ’, etc.; 
in effect, pronouns (cf. § 12 ). The logical formulae do indeed com¬ 
prise statements as well as matrices; the statements have definite 
import not by virtue of exhibiting names in place of variables, how¬ 
ever, but merely by virtue of containing enough quantifiers to 
render the variables bound at all occurrences. All this is apparent 
from the foregoing description and the examples. Subsequently, 
indeed, the logical formulae will be abbreviated through definitions 
in such a way that names of classes will take form within them; 
but such names will have the status merely of shorthand, capable 
always of elimination in favor of the frugal notation described 
above. 

We shall see in subsequent sections that the notions of identity, 
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relation, number, function, sum, product, power, limit, derivative, 
etc., are all definable in terms of our three primitive notational 
devices: membership, joint denial, and quantification with its 
variables. Under those definitions every true and every false 
statement which is couched in purely mathematical terms becomes 
an abbreviation of a logical formula, in particular a logical state¬ 
ment, in the sense described above. The three primitives thus 
provide a complete mathematical language. The language is not, 
indeed, a convenient one for mathematical practice; the simple 
statement ‘1 + 1=2’, if translated into terms of these three 
primitives, would run to a length of many pages. For mathe¬ 
matical practice — demonstration and application of theorems — 
definitional abbreviations are thus indispensable. For meta- 
mathematical practice, on the other hand — formulation and 
investigation of the general notions of formula, theorem, mathe¬ 
matical truth, etc. — the reduction of concepts to a few primitives 
is indispensable; the subject matter of our metamathematical 
investigations is thereby simplified to the point of manageability. 
(Cf. Introduction.) Reduction of the concepts of mathematics 
to the three primitives is of theoretical significance also as affording 
a measure of the net conceptual presuppositions involved in 
mathematical theory; and the definitions are significant as an 
analysis of the various derivative mathematical concepts. 

Frege was the first to show (1884) that the notions of arithmetic could be defined 
in purely logical terms. A more refined and detailed construction of arithmetic 
and derivative disciplines from logic was carried out by Whitehead and Russell. 
In the reduction of logic in turn to the three present primitives, one essential step 
was Russell’s discovery of how to define complex terms in context (cf. §26); a 
second was Sheffer’s reduction of conjunction, alternation, and denial to joint 
denial (§9); and a third was Wiener’s definition of the ordered pair (§36). The 
adequacy of substantially the three present primitives was mentioned by both 
Tarski and Godel in 1931. 

The systems of Tarski and Godel can in fact he translated, in turn, into terms of 
still fewer primitives: just inclusion (§34) and abstraction (§24). See my “In¬ 
clusion and Abstraction”: also “Theory of Types”. But the systematization of 
logic developed in the present book turns out, curiously, to resist such translation. 
The reason is that there are differences in detail berween the notion of membership 
developed here and the notion of membership which Tarski and Godel took over 
from Whitehead and Russell. The divergence is such that the equivalence of 
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r (a)<t>~* with r a<t> = V" 1 , on which the definition of quantification in terms of in¬ 
clusion and abstraction depends, fails in the present system (cf. § 32). 

The notion of logical formula does not supersede the original 
relative notion of formula. Logical formulae are formulae of a 
restricted kind, inadequate to the expression of anything beyond 
the bounds of mathematics. For other subject matters we would 
need additional formulae, describable by stipulating new atomic 
formulae appropriate to that subject matter. The atomic formulae 
formed by flanking ‘e’ with variables happen merely to be the ones 
appropriate to logico-mathematical matters. A statement may be 
a theorem under *100-*104 without being so limited in point of 
atomic parts as to be classifiable as a logical formula, logical state¬ 
ment; and all such statements are logically true, regardless of their 
constituent vocabulary. The theorems which we shall have oc¬ 
casion to consider in subsequent sections, however, will all be 
logical statements, logical formulae. 

Hitherto we were able to specify only the outward forms of 
formulae, leaving the content — the atomic formulae — unspecified; 
but now, so far as logical formulae are concerned, form and content 
are both at hand. We need no longer limit ourselves to setting 
down metatheorems, to the effect that all formulae of such and 
such forms are theorems; we can now set down actual theorems. 
From *103, e.g., we see that the closure: 

(y)((x)Oey) D.y t y) 

of ‘ (x)(x e y) D. y e y ’ is a theorem; from *157, again, we see that: 

( z )(}’)(0r)(y e z . x ey) = . y ez . (x)(x e y)) 
is a theorem. 

One might indeed think of logic, in a narrow sense, as comprising 
just the theory of statement composition and quantification, and 
hence as dealing with just the outward form of formulae through 
the medium of metatheorems. The level upon which we are now 
entering, then, might be regarded as forming a second level of 
mathematics — the theory of classes, otherwise known as the 
theory of sets, the theory of aggregates, Mengenlehre. At this 
level for the first time actual formulae are forthcoming, as theorems 
about a mathematical subject matter. Relation theory, arith- 
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metic, and other branches of mathematics then fall into place as 
specializations or subdivisions of the theory of classes. From this 
point of view, incidentally, the so-called “logical formulae” would 
be more appropriately describable as “mathematical formulae”; 
the former designation will be retained, however, and the narrow 
sense of ‘logic’ just now suggested will not be urged. 


§ 24 . Abstraction 

THE USUAL way of specifying a class is by citing a necessary and 
sufficient condition for membership in it. Such is the method when 
one speaks of “the class of all entities x such that . . . , ” appending 
one or another matrix. The class of all entities x such that x writes 
poems, e.g., is the class of poets; the class of all entities x such 
that 

(3y)(y £ integer . x - y 2 ), 

again, is the class of square numbers. Despite its sanction from 
the side of usage and common sense, however, this method of 
specifying classes leads to trouble. Applied to certain matrices, 
the prefix ‘the class of all entities # such that’ produces expressions 
which cannot consistently be regarded as designating any class 
whatever. One matrix of this kind, discovered by Russell, 1 is 
‘~(x e x)’; there is no such thing as the class of all entities x such 
that e x). For, suppose w were such a class. For every entity 
x, then, 

x e w .= ~(at e x). 

Taking x in particular as w itself, we are led to the contradiction: 

W £ W . = ~(w £ W). 

The matrix ‘ ~(x £ x) ’ is the first and simplest of an infinite series 
of matrices, viz.: 

~(x £ *), (y) ~ {x £ y . y t x), (y)(z) ~ (* e y . y t z . z £ x), 

. . . , all of which share the same peculiarity. In the case of each 
1 Principle St Ch. X. See also Frege, Grundge seize, vol. 2, pp. 253 fF. 
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of these matrices, as seen in the case of ‘ ~(x t x) the assumption 
of a “class of all entities x such that...” leads to contradiction. 
There is no class w such that 

(*)(*e».= ~ (x e x)), 
nor any such that 

(x)(x iw.= (y) ~ (x e y . y e x)), 

nor any such that 

(*){x ( w .s (y)(z) ~ (x e y . y e z . z e x)), 

and so on. This is apparent from the metatheorem *181, below, 
according to which the statements: 

(w) ~ ( x)(x tw .= ~ (x e x)), 

(■ w ) ~ (x) (x e w . = (y) ~ « y . y e t)), 

(w) ~ (t e w . = (y)(i) ~ (a: e y . y e t. t e t)), 

etc. are all theorems. 

It is convenient for purposes of future reference to prove a 
somewhat more complicated metatheorem *180 first and then 
derive *181 from it. In the proof of *180 a new sort of abbrevi¬ 
ation is used, involving the letters ‘L’ and £ R’. The notation 
‘L180’ stands as an abbreviation for the matter: 

(a)(a e /3 . = . <t>. (yi) . . . (y n ) ~ (a e yi. y i e y 2 . 7 n e a)) 

which appears to the left of the main connective in the expression 
for which ‘180’ stands. Similarly ‘R4’ stands as an abbreviation 
for the matter: 

~(/3 « 72 . 7 „ € a . a t &) 

which appears to the right of the main connective in the expression 
for which £ 4’ stands. ‘Ll’ and ‘R 6 ’ are analogous. 

*180. If a, /S, 7 i, . . . , y„ are distinct, and <f>' is like <j> except for 
containing free occurrences of /S wherever <j> contains free occurrences 
of a, then 

|- r (a)(a e 0 . = . <t>. ( 71 ) . . . (y n ) ~ (a e 71.71 e y 2 . 7 »««)) 

D ~ <t> n . 

Proof. Case 1 : n = 0. By *103 (& hp 2 ), 

(- r (o;)(a e 0 . = . <f>. ~ (a e a)) D : 0 e 0 . = . <t> . ~(j 8 e 0)’’ 

3 ~ <t> n . 


*100 
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Case 2 : n > 0 . 

*114 (- r ( 7 i) . . . (y„) ~ (ct « 7i.7»£«) 

3 ~ (a e 71 . 7 „ e a)" 1 . ( 1 ) 

*103 (& hp 1 ) [- r (^( 7 i) 1 3.]] Ll 3 ~ (a e P . P e 72 . 7 n e a)" 1 . (2) 

*100 (- r <t>. Ll .3 LU. (3) 

*110 [- r O)(<* € 0 .3 Ll)3 laepDLV 

*100 3. [2 33 ~'(j3 e 72 . 7 ,ea.af / 8 )" 1 . (4) 

*121 \- r Ll80 3. («)(« <p .3 Ll) [s (a)3l 

(4), *117 (&hp 1) 3 ( 72 ) • • • (y„)(«) R4 1 . (5) 

*171 (& hp 1) 

H T 5 — ■] L180 3 ( 71 ) . . . ( 7 „) ~ (P e 71.7i 6 72 .7«e 0)" 1 - ( 6 ) 

*103 (& hp 1) r R 6 3 ( 72 ) . . . ( 7 n) ~ (/3 e P . P e 72 . 7 n < (3 )" 1 

*103 (& hp 1) 3 ( 73 ) . . . ( 7 „) ~ (p e 0 . /3 £ 0. 7 * e 0) n 


*103 3 ~(0«0.0e0. (8 e 0 )" 1 


*100 3~(0 e 0)' 1 . (7) 

*103 (& hp) |- r L180 3: 0«0 .=. 0 '. R 6 n 
*100 3:[7 3.]R6 3 ( 8 ) 

*100 h r [8 3.6 3]180L 

*181. If a, 0 , 7 x, . . . , 7 „ a/r distinct, 

h r ~(ar)(a e 0 ■ = ( 71 ) • ■ • (t») (a « 7 i « 71 « 72.7» « a)) 1 - 

Proof. *100 |- r a ( a .3. a e a 1 . (1) 


*180 (& hp) KO)(“ e p . = . [1 .] ( 71 ) . . . ( 7 „) ~(ae 7 i- 

■ 7n « a)) 3 ~ (0 e P .3. P e 0) 1 . (2) 

*100 |- r [2 3] 181L 

We are thus forced to recognize that the idiom ‘the class of all 
entities x such that. . .’ cannot in general be trusted to express a 
class, common sense to the contrary notwithstanding. We must 
restrict this idiom in one way or another. There are a number of 
ways; we may revise the phrase ‘all entities x’, or we may restrict 
the matrix which follows ‘such that’, or we may do both. The 
first of these courses, adopted by von Neumann , 1 seems to recom¬ 
mend itself above the others. Under this procedure the realm of 


1 “Eine Axiomatisierung.” For historical remarks see below, §29. 
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entities which can be formed into classes is narrowed somewhat, 
by deciding that certain classes are incapable of being members of 
classes. The idiom ‘the class of all entities x such that’ is then 
revised to read ‘the class of all membership-eligible entities x such 
that. . 

For brevity let us refer to “membership-eligible” entities here¬ 
after as elements. An element is thus to be understood simply as 
the sort of entity that belongs to classes; ‘x is an element’ can be 
rendered symbolically ‘ (3y)(x e y) ’. Observe now how our revised 
idiom, ‘the class of all elements x such that . . . fares in connection 
with Russell’s matrix ‘~(x6x)’. Where w is the class of all 
elements x such that ~(ta), we are not called upon to equate 
‘ x tw’ with ‘~(x t x) ’. Rather, ‘ x t w’ means that x is an element 
and -~(x t x); symbolically, 

(3y)(xey) . 

Instead of having ‘(x)(xtw.= ~(ia))’, and the self-contra¬ 
dictory consequence ‘wtw .= ~ (wtw)’, we have merely: 

(x)(xtw. = . (3y)(*ey) . ~ (xtx)), 
whose consequence: 

w tw . = . (3y) (w t y ) . ~ (w e w) , 

far from being self-contradictory, is equivalent to: 

~ (3y)(w«y) 

and thus tells us merely that w itself is not an element. 

The status of element must be withheld not only from the class 
w of all elements x such that ~(x t x), but also from the class of all 
elements x such that (y) ~ (x t y . y t x), and from the class of all 
elements x such that (y)(z) ~ (x t y . y t z . z t x), and so on. 
The theorems to this effect, viz. the closures of: 

(x)(x tw . = . (3y)(* « y) ■ ~(3y)(w t y), 

(x)(x tw .=. (3y)(x t y) . (y) ~ (x t y . y t x ))D ~(3y)(w t y), 

( x){x t w . = . (3y)(x t y) . (y)(z) ^(xty.ytz.zt a))D 

~(3y)0 £ y), 

etc., are all provided by *180 ( <j> and <f>' being taken as r (35) 
(at 8)^ and r (35)(/3 t S) 1 ). The question of determining what 
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entities are to be rated as elements will be dealt with later (§28); 
for the present, suffice it to say that the only entities which will 
not be elements are certain queer classes such as the ones just now 
noted. 

The prefix ‘the class of all elements x such that’ will be rendered 
symbolically ‘x'; thus the class w discussed above is .v ~ (xtx). 1 
The forming of class names by such prefixes will be called ab¬ 
straction; and the result, e.g. ‘x ~ (x t x)’ or in general r a<l> 1 , 
will be called an abstract. Seeking now to define the notation 
r a<f' on the basis of our primitive notation, we encounter the 
following difficulty. Our primitive notation comprises machinery 
for the construction solely of statements and statement matrices — 
atomic matrices r (a«/3)~ l , quantifications r (a)4> 1 , joint denials 
r (0 I and an abstract r a<t> 1 cannot be construed as an abbre¬ 

viation of any expression of this sort, since it is supposed to be¬ 
have rather as a noun. The way out of this difficulty is suggested, 
however, by reflecting on fragmentary expressions such as ‘ 

‘D’, r D \f/ 1 . By no means identifiable with statements or state¬ 
ment matrices, these expressions have none the less been introduced 
by definitions; they have been defined in context. They are 
fragments of expressions r (4> = i/-) 1 and r (<p D i/O" 1 which have 
been defined entire as abbreviations of appropriate formulae. Now 
the same sort of contextual definition is available for abstracts. 
Turning our attention to such contexts of r aqO as are appropriately 
identifiable with formulae, we can frame suitable definitions of these 
contexts as wholes. By adopting such definitions in sufficient 
variety, we can account for the occurrence of r ap 1 in statement 
contexts of every desired kind. 

One definition is wanted, e.g., explaining contexts of the form 
r /3 e acf) 1 : contexts affirming membership in the class which r a4 > 1 
describes. Other definitions are wanted explaining other contexts; 
blit let us postpone them (to § 26) and deal with this first case. 
We want to define f 0 t ap 1 in such wav that, where . . x . . .’ is 
thought of as any formula involving 'x\ the whole: 

(1) V e X(. ■ ■ X . . .) 

1 This notation was adapted from Frege by Russell. But my reference to 
elementhood, in interpreting the notation, is a departure. Cf. § 29. 
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will agree in meaning with the words: 

( 2 ) y is a member of the class of all elements x such that . , . x . . .. 
This requirement proves to be met by the definition: 

D9. for r (3y)(i3.« 7 . (a)(a e 7 .3 0)) 1 

(7 being any new variable). D9 explains ( 1 ) as an abbreviation of: 

(3) (3z)(y t z . (x)(x t z .D.. .. x . . .)), 

whose equivalence with ( 2 ) is easily seen. Where z is the class of all 
elements x such that ... x ... , clearly 

(z)(ze!.D. 

hence (3) is true when ( 2 ) is. Conversely, since (3) says that y is a 
member of a class all of whose members are entities x such that 
... x we can conclude from (3) both that y is an element (be¬ 
ing a member of a class) and that y is one of the entities x such 
that . . . x . . . ; hence (2) is true when (3) is. 

D9 has the peculiarity of imposing like abbreviations upon unlike 
formuke; e.g., the formulae: 

(3z)(y e z . ( x){x e z .D. x e x)), 

(3w) (yew. (x) (x e w .3. x e x)) 

both become ‘yex(xex)’. But such formulae will always be 
alphabetic variants of one another, and hence interchangeable in 
view of *171 (and *123). Shift from one choice of 7 to another, in 
expanding r /3 e a# 1 by D9, amounts merely to relettering the 
expanded form by *171 — so long as 7 is kept distinct from a, /S, 
and the variables of <f>. 

The formulae and in particular the axioms of quantification and 
the theorems are, properly speaking, composed of the primitive 
notation without definitional abbreviations (cf. § 13). When a 
definitionally abbreviated expression is said to be a theorem, what 
is properly meant is rather that it is an abbreviation of a theorem. 
In all discourse about theorems, defined notations are imagined 
expanded into primitive notation. Hence caution is needed in 
adopting definitions which impose like abbreviations upon primi¬ 
tively unlike formulae. If a definition of that kind happened 
to cause some theorem and some non-theorem x to be abbre- 
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viated alike, fallacy would result; by merely abbreviating \p 
according to the definition, and then expanding the result accord¬ 
ing to the definition, one would reason from the fact that rp is a 
theorem to the false conclusion that x is a theorem. However, 
*171 has shown that D9 (with y understood as distinct from a, fi, 
and the variables of <j>) is not capable of thus blurring the distinction 
between theorems and non-theorems. Adoption of such a defini¬ 
tion as D9 amounts to tacit use of *171. 

Definitions which impose like abbreviations upon primitively 
unlike formulae will never obtrude themselves hereafter except in 
cases similar to D9: cases where a bound variable appears in the 
unabbreviated rendering and disappears in the abbreviation. But 
definitions of just this sort will turn up so frequently that it is 
simplest not to stop each time for an arbitrary alphabetical stipu¬ 
lation of the odd variable; rather it will merely be understood, 
as in the case of D9, that the variable is not to be so chosen as to 
conflict with the others at hand. The puristic reader can still, if 
he prefers, reconstrue all such definitions arbitrarily as calling for 
the alphabetically earliest variable other than those at hand; for 
he can always rewrite his variable afterward at will by *171. 


§ 25 . Identity 


WE TURN now to the problem of so defining ‘ x — y’, in terms of 
‘e’ and our other primitives, that it will carry the intended sense 
‘x and y are the same object’. In the trivial case where y is not a 
class, indeed, x e y if and only if x = y in this sense (cf. § 22 ); but 
our problem remains, since ‘xey’ diverges in meaning from 
‘x = y’ in case y is a class. We must find a formula, composed of 
‘x’ and ‘y’ by means of ‘e’ and our other primitives, which will be 
true just in case x and y are the same object — whether a class or 
a non-class. 

The requirement is met by: 

(1) (z)(zex . = .zey) 

when x and y are classes, since classes are the same when their 
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members are the same (cf. § 22). Moreover, (1) continues to meet 
the requirement when x and y are not classes. For, in this case 
‘z e x’ and ‘z e y ’ identify z with * and with y; and (1) as a whole 
then says that whatever is the same as x is the same as y, thus 
identifying x and y. Both where x and y are classes and where they 
are not, therefore, (1) meets our requirements; (1) is true if and 
only if x and y are the same. We are thus led to introduce ‘ x = y’ 
as an abbreviation of (1). 

This definition of ‘ = ’ has one rather curious but harmless result: 
where x is not a class, we do not have over and above x any class 
whose sole member is x. What would appear to be the class whose 
sole member is x reduces to the non-class x itself. For, suppose x 
is a non-class and y is the class whose sole member is x. Then z e y 
if and only if z is x. But also, according to the interpretation of 
'z e x’ adopted for the case where x is not a class, z t x if and only if 
z is x. Hence (1) is true; i.e., according to our definition, x = y. 

This result — suppression of the class of a single non-class in 
favor of the non-class itself—may just as well be conceived in 
reverse fashion as an assimilation of non-classes to classes. What 
was regarded as a non-class becomes reconstrued as a special sort 
of class, viz. a class having itself as sole member. If we think of the 
matter in this way, we must of course abandon the term ‘non¬ 
class’ in favor of a more neutral designation — say ‘individual’. 
Everything comes now to be thought of as a class, but individuals 
are distinguished from other classes by the peculiar circumstance 
of being their own sole members. This way of phrasing the situ¬ 
ation is more convenient than the other, for it enables us to give 
‘x ey’ always the uniform reading ‘x is a member of the class y’. 
Note, in any case, that we are far from identifying every object x 
with the class whose sole member is x; this happens only if x is an 
individual. From a formal standpoint, actually, there is no need 
of assuming that there is any such x at all (cf. § 22). 

Variables and abstracts will be spoken of collectively as terms. 
Now let us supplement our Greek-letter conventions to this extent: 
just as we use ‘</>’, ‘if/’, and ‘x’, to refer to any formulae, and ‘a’, 

‘ 13’, ‘y’, and ‘5’ to refer to any variables, so let us use ‘f‘ij’, and 
‘6’ (along with their accented and subscripted variants) to refer 
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in general to any terms. With help of this convention we can ex¬ 
press the general definition of identity as follows, for application to 
variables and abstracts indifferently: 

DIO. r (f = rj)" 1 for r {a){a « f . a e rj) 1 . 

The notation r a e f" 1 here is either primitive or explained by D9, 
according as f is a variable or an abstract; and similarly for 
r aer| 1 . The choice of a follows the usual understanding (§24); 
it may be any variable other than the ones in f and tj. The pa¬ 
rentheses in r (f = t;)" 1 record the fact that ‘ = ’, like all binary 
connectives, is to be construed as carrying with it a pair of pa¬ 
rentheses. 1 

It is more common to define r f = 17 1 as r (a)(f e a . =. t) « a ) 1 or r (a)(f ta.D. 
t)f a)' 1 . Both of these versions are given by Peirce (3.398, 400) and one or the 
other is rather hinted by Leibniz (Lewis, Survey, p. 373); but they cease to be 
available once non-elements are recognized (cf. § 32). DIO, as a definition of class 
identity, was stated in effect by Peirce (3.47) and in its present form by Fraenkel 
(Einleitung, 3d ed., p. 272). Its availability as a general definition of identity 
turns on the special way in which I have dealt with individuals. 

My treatment of individuals is reminiscent of the days before Frege and Peano 
urged the importance of distinguishing between an object and its unit class, i.e. 
the class which has that object as sole member (cf. §35). Commonly the dis¬ 
tinction in question is indeed vital. We must distinguish between the null class 
(§ 26) and its unit class, for the former has no members while the latter has one; 
and we must distinguish between the class of Apostles and its unit class, for Peter 
belongs to the one class and not to the other. But the arguments which show that 
the distinction must commonly be made do not militate against the identification, 
in particular, of individuals with their unit classes (and hence with the unit classes 
of their unit classes and so on). Within the domain of individuals, retention of the 
pre-Fregean attitude leads to no trouble. 

Though I have been led to this course by considerations only of technical effi¬ 
cacy, a certain naturalness may be argued for it on the basis of such illustrations 
as the following. Let us think of points as individuals, and of lines and planes as 
classes of points. Then the intersection or logical product (§ 33) of two planes is a 
line; but, if we distinguish individuals from their unit classes, the intersection of 
two lines is only the unit class of a point and not a point. Again, whereas a line 
in a given plane is related to the plane by inclusion (§ 34), a point in a given line 
is not related to the line by inclusion if we distinguish individuals from their unit 


1 In the matter of parentheses, “ = ’ is like V; the parentheses are adopted with 
an eye to later definitions. See p. 124, footnote. 
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classes; the point is rather a member of the line, while the unit class of the point 
is included in the line. Under the proposed theory, on the other hand, intersections 
of lines are points, and points bear to lines the same inclusion relation which lines 
bear to planes. (This illustration is due to Miss L. D. Steinhardt.) 

The usual compact way of denying identity is adopted in the 
following definition. 

dii. r Q;*vr for r ~(r = t?) n . 

We proceed to three theorems. 

fl82. (*) x = x Proof: *100 (& DIO). 

fl83. (y)<» * = y.=.y=* Proof: *100, *123 (& DIO). 

|184. (s)(y)(#) * = y .D:* - i.m.y - % Proof: *121 (& DIO). 

In stating theorems it is convenient to leave a space between the 
initial group of universal quantifiers and the matrix which they 
govern, on the understanding that those quantifiers are to cover 
the matrix in its entirety. The theorem is thus the closure of this 
segregated matrix; f 182—f 184 are; 

(*)(* = x), 

{y){x)(x =y .as. y = *), 

(z)(y)(x)(x = y .D: x = z .=. y - z). 

The fact that fl82-fl84 are theorems, rather than metatheo¬ 
rems, is given recognition by attaching a dagger ‘ f rather than a 
star to the numerals. Apart from the distinguishing stars and 
daggers, the numerical designations of metatheorems and theorems 
run along together in a single increasing sequence; first we have 
metatheorems (*100-*181), then some theorems (f 182— fl85), 
then more metatheorems (*186-*188), and so on. 

The proofs of tl82-fl84, indicated above, call for little com¬ 
ment. fl82 is an abbreviation, by DIO, of: 

(x)(to)(to e x . = .w ex), 

which is the closure of a tautologous matrix r <j> = <jP; and *100 
says that closures of such matrices are theorems. fl83 is an 
abbreviation, by D10, of: 

(y)(x)((w)foe *. = .w«y) = (w)(w ey .=. w e x)), 
which is the closure of a matrix r (a)<t> s (a)^ 1 such that r 4> = ip 
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is tautologous; and *100 and *123 say that closures of such 
matrices are theorems. |184 is an abbreviation, by DIO, of: 

(z)(y)(x) 

((w)(w e x . = . w « y)D. ( w){w ex.s.aes) = (w)(w ey.=.w« z)), 

which is the closure of a matrix r (a)(<f> = <£')D. 'P = of the kind 
described in *121. 

fl82 and fl83 are known respectively as the principles of the 
reflexivity and commutativity (or symmetry) of identity, by a 
natural extension of the notions of reflexivity and commutativity 
which were applied to statement composition (111). 1 The fol¬ 
lowing, by a similar extension, is called the principle of the tran¬ 
sitivity of identity: 

(z)(y)(*) x = y .y = z.D.x = z. 

tl84 is a stronger form of this principle. 

The proof of the next theorem is more elaborate. It illustrates 
the notation which will be used hereafter in proofs of theorems, as 
distinct from metatheorems. 

fl85. (y) (x) ye*.= (3z) (y e z. z - x) 

Proof. *110 (& DlO) z = x .3: y e z .=. y ex (l) 

*100, *163 (3z)(Cl J y * z . z = x)D. y e x (2) 

*134 y e x [. 182] .3 (3z)(y e z . z = x) (3) 

*100 [2.3 .D] 185 

The expressions which occupy the several lines in the proof of a 
theorem are not in general theorems themselves but rather formulae 
whose closures are theorems. The references at the left of the line 
substantiate the fact that the closure of the exhibited formula is a 


1 The above instance suggests this simple formulation of reflexivity as applied 
to predicates: a reflexive predicate is one whose insertion in the blank of ‘ (.v)(.v v) ’ 
yields a truth. Such reflexivity may be called total reflexivity (cf. Carnap, Abriss, p. 
40); for the term ‘reflexive’ is also commonly applied, more broadly, to any predi¬ 
cate whose insertion in the three blanks of: 

(x)(y)(x y .V. y xO.x x) 

yields a truth. Note that the notion of reflexivity as applied to statement com¬ 
position (§ li) can be broadened in analogous fashion; conjunction then comes to 
be classed as reflexive, though not totally reflexive. 



§25 IDENTITY 139 

theorem. Thus, in the first line of the above proof we cite *110 
(and DIO) to show that 

b ‘z = x .3: y e z y e x’, 
i.e., to show that the closure: 

(z)(y)(*) z = x .3: y e z .s=. y «x 

is a theorem. The numeral at the right of a line serves, in its 
unparenthesized recurrences, as an abbreviation of the formula 
appearing in that line (minus any bracketed matter); thus ‘1’ in 
the second line of the proof is an abbreviation of: 

z = x .D'. y e z . = . y e x, 
and ‘2’ in the last line is an abbreviation of: 

(3z)(y ez.z=x)D.yex. 

Brackets indicate use of *124, as hitherto; having shown e.g. that 
the closure of the matrix which ‘1’ abbreviates is a theorem, we 
bracket ‘ 1 out of the second line of the proof on the strength of 
*124. Divorced of its dagger, the numeral of a theorem serves as 
an abbreviation of the theorem minus such initial quantifiers as are 
spaced off to the left; thus ‘182’ and ‘185’ are abbreviations of 
the respective matrices: 

x = x, (3z)(y tz .z = x), 

whose closures are the theorems 'j'182 and fl85. 

The combination of *100 and *163, as in the second line of the 
above proof, is one which will appear frequently hereafter. In 
such a step of proof we observe first that the whole line minus the 
existential quantifier, in this case: 

l.ytz.z = x.D.yex, 

is tautologous; then we introduce the quantifier on the strength of 
*163, noting that the variable of quantification is not free in the 
consequent. The number of quantifiers thus introduced need not 
be limited to one; in general, *100 and *163 show that 

Ktfai) • • ■ (3«,)* 3 r 

where r <t> 3 ip 1 is tautologous and on,..., a„ are not free in \p. 



140 


TERMS 


§26 

Combination of *100 with *117 will also be common; in such cases 
the added quantifiers are universal rather than existential, and 
apply to the consequent rather than the antecedent. 


§ 26 . Abstraction Resumed 

LET US now return to the task of defining abstracts r a4 > 1 in con¬ 
text. D9 provides for all contexts in which the abstract follows an 
occurrence of ‘ P which is preceded by a variable. But now we 
must provide for contexts in which the abstract precedes an oc¬ 
currence of ‘t’ which is followed by a variable; and also we must 
provide for contexts in which abstracts stand on both sides of 
These two cases can be lumped together: our problem is to define 
expressions of the form r a<f> e f 1 , where f is a variable or an ab¬ 
stract. Now that DlO is at hand, explaining ‘ = ’ in application to 
variables and abstracts alike, our problem is easily solved: 

D12. r (<S0 e for r (3/3)(/3 = <$0 .0 e fP. 

The notation r 0 e p here is either primitive or explained by D9, 
according as f is a variable or an abstract. The choice of follows 
the usual understanding; it may be any variable other than a and 
the ones in <t> and f. 

The notation r f e up is now generally accounted for, no matter 
what terms f and ij may be. If f and -q are variables, e rj n is 
primitive; if f is a variable and q an abstract, t q 1 is explained by 
D9; and if f is an abstract, e rp is explained by D12. Two minor 
conventions of abbreviation will now be added. 

D13. r (f i qP for r ~ (£ e qP- 

D14. r (fi, 5*2, • • • , fn e qP for r (fi t q ■ h e V . P t qP- 

Just as r a ( /3" 1 and r /3 e a 1 are atomic contexts of a, so we may 
speak of r d<j> e rp and r q t dip as quasi-atomic contexts of r d<fP. 
More generally, a quasi-atomic context of a term f is any result of 
putting f for one variable and perhaps other terms for other vari¬ 
ables in an atomic matrix. Now D9 and D12 explain all those 
quasi-atomic contexts of r difP which correspond to logical atomic 




ABSTRACTION RESUMED 


141 


§26 

matrices r ae/3~ l ; for D9 explains r y e dip 1 and Dl2 explains 
r d<p t rp and r yip « dip 1 . But extra-mathematical discourse would 
involve atomic matrices of other forms than r a e fi 1 , and would 
thus call for definition of r d<jP in supplementary quasi-atomic 
contexts. All such definitions can follow the pattern of D12; 
corresponding to an atomic matrix r n fii /? 2 • • • fin 1 , in general, we 
can introduce quasi-atomic contexts by a sequence of n definitions 
as follows: 

Dtfl. C H fil fii . . . fin-2 fin-1 dip 1 for r (3/3„) (j8 n = <X<j> . ft fa fa .. . 

fin-1 fin -1 fin) 1 ) 

D <Z 2 . r H fil fi 2 . . . fi n . 2 d(j> f J for r (3fin-l)(.fin-l = dtp . n fil fit . . . 

fin-2 fin- 1 fn) 1 , 


Da n . r jua<i> f 2 . . . U -2 fn -1 fn 1 /or r (3fii)(fii - dip . ft fii ft .. . 

tn-2 tn-1 fn) • 

Since from the standpoint of primitive notation any free occur¬ 
rence of a variable has an atomic matrix as its immediate context, 
our definitions of the quasi-atomic contexts of r d(t> 1 explain ab¬ 
stracts in all manners of occurrence which are available to free 
variables. Definitions which abbreviate contexts of free variables 
can then be so framed as to apply at the same time to abstracts, as 
already observed in the case of DIO, Dll, D13, and D14. AH 
occurrences of an abstract will turn out to stand in quasi-atomic 
contexts when the overlying definitional abbreviations as of DIO, 
Dll, D13, D14, etc. are resolved. 

An occurrence of r dtp 1 , in any quasi-atomic context, comes to 
have an immediate context of one or other of the forms r y e dip 1 
and r fi = dtp 1 when overlying abbreviations are resolved; this is 
clear from Dl2 and Dai~a n . Furthermore, r /3 = dip 1 in turn is 
short for 

r (y)(7 tfi. = . ye dtp) 1 

where the immediate context of r a<f> 1 is again of the form 
r y t dip 1 . The latter is thus the fundamental sort of context for 
abstracts; every occurrence of r a<£ 1 , in any quasi-atomic context 
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and therefore in any context whatever, comes to have an immediate 
context r y e dp 1 on definitional expansion. Once r y e ap 1 is 
expanded in turn, by D9, the abstract r ap 1 disappears altogether. 

Inspection of D9 shows that all free occurrences of variables in p 
remain free in r y e dip 1 , except for occurrences of a; but that all 
occurrences of a in p are bound in r y t dip 1 . Moreover, this is 
true not only in the case of r y e dip 1 but in the case of any other 
quasi-atomic context p of 1 dip 1 as well; for we have noted that the 
expansion of p by D12 or Da\-a n (and DIO) will merely give us 
r y t dip 1 again within a broader context, and by the general con¬ 
vention governing the choice of extra variables in definitions we 
are assured that the quantifiers of this broader context will not in¬ 
volve variables of <p. In all contexts, therefore, an abstraction 
prefix r cT operates in the same way as a quantifier r (a) n so far 
as bondage and freedom are concerned. 

When we speak about a formula, we are ordinarily to think of it 
as written out in full primitive notation (cf. § 13). But it is some¬ 
times convenient to speak of occurrences of an abstract r a<p 1 in a 
formula p\ and this usage does compel us to think of p as tem¬ 
porarily subject to definitional abbreviations, to the extent of 
accommodating the occurrences of r &p 1 in question. The abbre¬ 
viations thus imagined in p are still conveniently limited to quasi- 
atomic contexts of r ap 1 . Insofar as a formula p is discussed from 
this point of view, questions of bondage and freedom call for ex¬ 
plicit consideration of abstraction prefixes along with quantifiers. 
As an adjunct of discourse about occurrences of abstracts, ac¬ 
cordingly, the following practical elaboration of the terminology of 
of § 14 is needed. A given occurrence of a is bound to a given 
occurrence of r («) 1 or r a 1 if it stands in a formula or abstract 
beginning with the given occurrence of r (a) n or 'a 1 and stands in 
no formula or abstract beginning with a later occurrence o f r M n 
or r <$4 An occurrence of a formula or term v is bound in a formula 
or term p with respect to a if within that occurrence of v there is an 
occurrence of a which is bound to an occurrence of r (a) n or r a 1 
lying in p but outside v. An occurrence of v is bound in p if it is 
bound in p with respect to at least one variable; and v is bound 
in p if an occurrence of v is bound in p. An occurrence of v in p 
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is free in ju if it is not bound in /j; and v is free in /u if an occurrence 
of v is free in Clearly an occurrence of x will be bound in p 
with respect to a, in the sense of the present paragraph, if and only 
if the corresponding occurrence or occurrences of x in the primi¬ 
tive expansion of p are bound with respect to a in the sense of 
§ 14; and the rest of the terminology of the present paragraph, 
meshes with that of § 14 in similar fashion. 

The following analogues of *121—*123 are typical of the meta¬ 
theorems which involve use of this extended terminology. 

*186. If 77 ' is like 77 except for containing p' at some places where rf 
contains p, and oti>. . . , a„ exhaust the variables with respect to which, 
those occurrences of p and <t>' are bound in 77 and 77 ', then 

h r (a0... («„)(* -*') 3. ij-V. 

Proof: * 121 , *117 (& DIO). 

*187. If v’ is like 77 except for containing free occurrences of p’ at 
some places where 77 contains free occurrences of p, then 

h r <t> = 4>' -3- V = V n - 
Proof: *186 (taking n as 0). 

*188. If f- r p — 4> n » and r)' is formed from 17 by putting 4> f for some 
occurrences of <f>, then \- r ij = ri n . 

Proof from *186 like that of *123 from *121. 

But it must be borne in mind that the theorems provided by 
such metatheorems as *186-*188 are still to be thought of finally 
as in full primitive notation. *187, e.g., amounts properly to the 
following: If r\ is like 17 except for containing free occurrences of p' 
at some places where 7 ; contains free occurrences of tf>, then the 
formula (in primitive notation) which the definitions abbreviate as 
r p = p' . 1 ). 77 = 77 n is one whose closure is a theorem. 

The following theorem shows that ‘y’ cancels ‘y e’; every entity 
is the class of all those elements which are its members. 

|189. (x) x = y(y e x) 

Proof. *100 y ex.D.ytx (1) 

*134 z e x [. (y)l] .3 (3w)(z t w . (y)(y e w .D. y e x)) (2) 

*103 (y)(y € w .D. y c x)Dz tw .D. z e x (3) 



144 


TERMS 


§26 

(4) 


*100, *163 (3»)([3 .] z tw . (y)(y e w .3. y « #))3. zex 

*140 (& D5, 9, 10 ) 189 [>. (z )2 . (z)4] 

The following two definitions introduce abbreviations for two 
particularly important abstracts. 

D16. ‘V’ for ‘x(x = x)’, 

D16. ‘A ’ for‘x(x x)’. 

V is, by definition, the class of all those elements which are 
self-identical; i.e., since everything is self-identical (fl82), V is 
simply the class of all elements. 


fl89a. 

(*) 

xeV .= (3y)(*«y) 


Proof. 

*100 

[182 .~]x ty .D. x = x 

( 1 ) 


*100 (& D9. 

, 15) x t V .= (3y)(x t y [. (x)l]) 


tl90. 

(y)(x) 

*«y.3.auV 


tl91. 

(y)(*) 

x ey . = . x tV . x t y 


Proofs. 

*135 

x ty . 3 (3y)(* ty) 

( 1 ) 


*100 

[189a . 1 .3] 190, 191 



A is defined as the class of all those elements which are not self¬ 
identical; hence it is the memberless class, the so-called null class. 
fl92. (#) x t A 

Proof. *110 (*)(# t y .3. x y* *)3: x ey .3. x ^ x ( 1 ) 

*100 [1 . 182 .3] ~ (x e y . (x)(x e y .3. x ^ x)) ( 2 ) 

*131 (& D9, 16) 192 1= (y) 2 ] 

Citation of the trivial definitions Dll and D13 will ordinarily be 
omitted, as in the above proof. Similarly for Dl4, D 6 , and D7. 
tl92a. (y) y - A .as ( x)(xty ) 

Proof.' *100, *123 (& DlO) y = A (*)(* iy[,. 192]) 
fl92b. (y) y A .= (3x)(*ey) 

Proof. *100 (& D 8 ) [192a 3] 192b 

*193. f- r (a)<j> 3. <$<f> = V 1 . 

Proof. *103 1- r [(*)182 3.] a = ( 1 ) 

186 (- r (a)(4> [ = 1])3. a<j> = &(a = a) 1 

*171 (& DlS) 3 . &<t> = V" 1 . 


1 Tills short proof was suggested by I. Kaplanslty and M. J. Norris. 
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*194. l- r (a)~0 D. <$</. = A" 1 . 

Proof. *103 1- T(*)182 « = aP- (1) 

*186 (- r (ct)(~<£ []= 1])3. & = &{a 5^ a) n 

*171 (& D16) D. & ~~<j, =A’’ 

*100, *123 D. <$<!> = A" 1 . 

In the above proofs, *171 is used as if the biconditional r tj> = <j> n 
thereof were a mere conditional r <j> D <p n . In the last step of the 
proof of *194, again, *123 is used as if the r ^ = <f/ n thereof were 
r \f/ D if/ n . This procedure involves tacit application of *100, of 
such kind as to weaken the biconditional to a conditional. Ellipsis 
of this sort will be common hereafter; a biconditional r <j> = 
whose closure is cited as a theorem will be subject to any of the 
variant readings r \f/ = <jp, r <j> D t/P, and r \f/ D <tP at convenience. 
But this innovation applies only to citations by starred and 
daggered numerals; divorced of its star or dagger and used as an 
abbreviation, the numeral continues to stand for just the one ex¬ 
pression for which it has hitherto stood. 

In precisely the sense in which D9 and D12 constitute contextual definitions of 
r a<t>~ Dl-5 may be said to constitute contextual definitions of V, ‘V’, O’ 
and ' = likewise D 8 of‘ 3 ’, and D10 of ' = The special subtlety of D9 and D12 
is merely this: they endow r a 0 n with contexts notationally similar to contexts 
primitively accessible to variables, thus enabling us to handle r atf > 1 as a term 
(unlike V, ‘3’, ‘ = ’> etc.). Such contextual definition of terms was initiated 
by Russell (1905, 1908) in his introduction of descriptions (§ 27); I have preferred 
to apply the method to abstracts, however, and then explain descriptions as direct 
abbreviations of certain abstracts. (Russell’s theory of classes also proceeds by 
contextual definition, but in a fashion irrelevant to present concerns; cf. footnote 
to § 22.) Russell’s method of contextual definition differs in detail from that ex¬ 
hibited above, and is more complex; it depends, for the avoidance of certain 
ambiguities which would otherwise arise, upon use of an auxiliary scheme of pre¬ 
fixes (cf. Principia, *14). The fact that Russell is concerned with descriptions 
rather than abstracts is not responsible for those complications; in “New Founda¬ 
tions,” D9-10, I have defined description along lines similar to the above con¬ 
textual definitions of abstraction. 

The so-called universal and null classes V and A played a prominent role in the 
class algebra developed by Boole and his successors (§ 33). The signs ‘V’ and ‘A’ 
are Peano’s. But the present use of ‘V’ as a means of affirming elementhood is 


new. 
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§ 27. Descriptions and Names 

WHEREAS the prefix ‘.v’ corresponds to the words ‘the class of 
all elements x such that’, the prefix ‘(ix)’ is to correspond rather 
to the words ‘the one and only object (entity) x such that’. This 
idiom ‘ (i#)( . . . x . . .) ’, i.e.: 

(1) the object x such that . . . x ... , 

is known as description. It has now to be defined. 

Actually, the meaning of a description is not always clear under 
ordinary usage. The meaning of the description ‘(ix)(x is capital 
of France)’, or ‘the object x such that x is capital of France’, or 
briefly ‘the capital of France’, is clear enough; this description is a 
complex name which designates the city of Paris. Again the 
meaning of ‘ (ix'. (5 +x = 9)’ is clear; this description designates 
the number 4. But what of the descriptions ‘ (ix) (9 > x) ’, 
‘(i.v)f.v 7 * x'}% ‘ (ix) (Jones loves *)’? We cannot say that ' (ix) 
(9 > x)’ designates the number less than 9, for there are many 
such numbers. We cannot say that ‘(ix)(x y* x) ’ designates the 
object which is distinct from itself, for there is no such thing. We 
cannot say that ‘(r*) (Jones loves x) ’ designates the person loved 
bv Jones, for we do not know whether there is such a person; 
Jones may love several, or none at all. Ordinary usage gives us 
no clue as to the meaning of such descriptions, since use of the 
idiom (1) is ordinarily limited to cases where one and only one ob¬ 
ject is believed to satisfy the condition ' ... x ...'. Everyday use 
of descriptions is indeed often elliptical, essential parts of the con¬ 
dition '. . . x . . .’ being left understood; thus we may say simply 
‘the vellow house’ (i.e. ‘ (rne)(x t yellow . x e house) ’) when what 
is to be understood is rather ‘the yellow house in the third block 
of Lee Street, Tulsa.’ But when in ‘. . . x . . .’ there is nothing 
either expressed or tacitly understood which is adequate to deter¬ 
mining the object a- (to the best of the speaker’s belief), the idiom 
(1) is not ordinarily used. 

Does this mean that we should reject the notation r (ia)<£ n as 
meaningless in those cases where p is satisfied by no object or bv 



§27 DESCRIPTIONS AND NAMES 147 

many, rather than by exactly one? This would fit ordinary usage 
of the idiom (1), but from the standpoint of logical analysis such a 
formulation is awkward. It is awkward, in general, to let ques¬ 
tions of meaningfulness or meaninglessness rest upon casual 
matters of fact which are not open to any systematic and conclu¬ 
sive method of decision. We may never know whether Jones loves 
none, one, or many; and it is best not to have to wait for that 
information in order to decide whether to accord the expression 
‘(mc)(J ones loves x)’ a place in our language. The truth or false¬ 
hood of statements must indeed wait, in general, upon inquiries 
which lack any systematic and conclusive technique; but the 
meaningfulness of an expression — the eligibility of an expression 
to occur in statements at all, true or false — is a matter over which 
we can profitably maintain control. Let us then so define 
as to provide for its significant use regardless of the number of 
objects happening to satisfy the condition <t>. In accordance with 
(1) we want (?#)(...#...) to be the sole object x such that 
... x .. . so long as there is such an object; but in the remaining 
cases we may construe (i*)(. ..#...) in any arbitrary fashion that 
proves convenient — let us say as A — since ordinary usage affords 
no preconceptions regarding such cases. 

It will be found shortly (*196, *197) that such a version of (i#) 
is achieved by the following definition. 

D17. r (ia)0 n for r /3(3y)(d e y . (a)(a - y. = 4>)V- 
The choice of /S and y in D17 is subject to the obvious convention: 
they are to be distinct from each other, from a, and from the 
variables of <j>. 

Just as the prefix ‘y ’ cancels ‘y «’ (cf. J189), so the prefix £ (iy)’ 
cancels ‘y = ’. This is shown in the following theorem. 

f!96. (x) x = (jy) (y = x) 


fl85, *123 

[189 = 

.] * = y(3z)(y ei.i = *) 

(1) 

*171 

[1 «.] 

X = w{3z)(w f z . z = x) 

(2) 

*103 

[0t)182 

!3.]s = s 

(3) 

*103 

(y)(y = 

III 

II 

* 
v—/ 

u 

1—1 

i 11 

L_J 

II 

* 

(4) 

*121 (&D10), 

*117 

z = x .D(y)(y = z . = . y = x) 

(5) 

*121 (&D5, 17) 

[(*)(5.4)D. 2 s] 195 
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The bracketing in the last line, above, can be resolved into steps 
as follows: 

(z)(5 [. 4])3. 2 = 195, [(z)5 3.] 2 = 195, [2 = ]195. 

The matrix ‘ {x)(x = 'y .=.... x ... )’, according to which 
‘. x . . is true of anything x just in case x is y, constitutes a 
simple symbolic translation of the words ‘y is the sole object x 
such that ... x Now we want (ix)(. .. x ... ) to be this 

object }’, if such there be; and otherwise we want (?x) ( . . . X . . . ) 
to be A. I.e., the closures of: 

(x) (x = y . = .... x ... ) 3. y = (ix) ( . . . x . . . ) 
and: ~(3y)(x)(x = y . = x ... ) 3, (ix)( ... x ... ) = A 

are wanted as theorems. That D17 achieves these objectives is 
established in the following two metatheorems. 

*196. 7/0 is distinct from a, 

\- r (<x)(a = j3 .= <t>) 3. (3 = (i a)<p. 

Proof. *171 (& hp) )- r [(x) 195 =] (&)(& = fa)(a = jS))" 1 . (1) 

*110 t- r [1 3.] is = (»«)(« = d) n - ( 2 ) 

*121 (- r (a)(a = P .s 0)3 : [2 =.] /3 = (W)^. 

*197. If /3 is not a nor free in <f>, 

I" r ~0/3)(a)(a = <t>) 3. ( ia)4> = A n - 

Proof. Let 7 be new. 

*156 |- r (3d)(7ed. («)(« =0- ^ 0)) 3. 

(3/3 )(7 £ / 8 ) . (3/3)(ffi)(a = 0 . = (p) 1 . (1) 
* 100 , *117 r L 1 7 3 ( 7 )(Cl 3] Ll) n 
*194 (& D17 & hp) 3. ( 7 a)<t> = A 1 , 

Since (ix)(. . . x . . .) as of D17 is patently a class, whereas the 
one and only object fulfilling '... x .. .’ may be any sort of object, 
the identification of the two will perhaps seem strange; but only 
in case the reader has lost sight of the assimilation of non-classes 
to classes (§ 25). An individual is a class, for it is identical with the 
class whereof it is sole member. Or, if the reader prefers the 
equivalent but less convenient phrasing, according to which an 
individual is neither a class nor the sole member of a class, then he 
must recognize that an abstract may in particular designate an 
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individual rather than a class; so that there is again no strangeness 
in the identification of (?*)(. with an individual. In any 

case the argument in *196 is completely general, and shows that 
(7*)(. is the one and only object x such that . . . x . . . 

whenever there is such a unique object. 

Frege ( Grundgesetze , vol. 1, p. 19) was the first to offer an explicit formulation 
of description. His version is equivalent to that provided by Dl7 except for the 
treatment of those uninteresting cases where <f> is fulfilled by more than one value 
of ct\ in such cases he arbitrarily gives r (ta) 0 ^ the sense of r a0 _5 . Still a third 
version of r (7a')</>" , f equivalent both to Frege’s and to D17 except for those waste 
cases, is provided by defining r (?ct)0 _1 in the very simple fashion r $(3 a )($ € a ■ 
but I have preferred so to frame D17 as to identify the waste cases uniformly with 
A. 

The notation r (ta)0 _1 , somewhat different from Frege’s, was adapted from 
Peano by Russell. The sign *i' is supposed to suggest a certain inverseness to the 
notion of unit class expressed by ‘t’ (§ 35). 

Concerning the philosophical background of description theory, the limited 
remarks of the present section may well be supplemented by reading Russell 
“On Denoting.” 

All the names we ever need, for pure logic and mathematics or 
for any other sort of discourse, are adequately provided by ab¬ 
straction. For, suppose there is a name, say ‘Europe’, which is 
not defined as an abbreviation of an abstract but is instead simply 
a primitive term of geography. By a trivial revision of our geo¬ 
graphical primitives and definitions we can reconstrue this name 
as an abstract, and in particular as a description (which is a 
special sort of abstract, by D17). This is accomplished as follows. 
Instead of adopting the name ‘Europe’ as primitive, we may adopt 
a primitive form of atomic matrix r eur of 1 having the following 
sense: ‘eur x\ taken as a whole, amounts to what would have been 
expressed in ordinary language as ‘x is Europe’. Now we can 
introduce the name ‘Europe’ by definition as an abbreviation of 
‘ (ix) eurt’; for if there is one and only one object x such that 
eur x then by *196 that object is (j.r) eur x. The predicate ‘eur’ 
itself is not a name, but a fragmentary sign like ‘e’, ‘ = ’, ‘ ~ ’, ‘ 
it can occur only attached, in contexts of the form r eur a A 

There is little point, one may suppose, in avoiding primitive 
names by such artificial methods. Why is it better to adopt the 
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predicate ‘eur\ and then define ‘Europe’, than to adopt ‘Europe’ 
outright? Actually the suggested course has, despite such artifi¬ 
ciality, a considerable theoretical advantage; for the following 
difficulty otherwise arises in connection with the notion of exist¬ 
ence. 

To say that somethin y does not exist, or that there is something 
which is not , is clearly a contradiction in terms; hence ‘(x)(x 
exists)’ must be true. Moreover, we should certainly expect leave 
to put any primitive name of our language for the ‘x' of any 
matrix . . x . . and to infer the resulting singular statement 
from ‘ (x)(. . . x . . .)’; it is difficult to contemplate any alternative 
logical rule for reasoning with names. But this rule of inference 
leads from the truth ‘ (x) {x exists) ’ not only to the true conclusion 
‘Europe exists’ but also to the controversial conclusion ‘God 
exists’ and the false conclusion ‘Pegasus exists’, if we accept 
‘Europe’, ‘God’, and ‘Pegasus’ as primitive names in our lan¬ 
guage. The atheist seems called upon to repudiate the very name 
‘God’, thus depriving himself of vocabulary in which to affirm his 
atheism; and those of us who disbelieve in Pegasus would seem to 
be in a similar position. 

But this difficulty is resolved by the suggested procedure of 
adopting matrices r eur a 1 , r god 'peg a 1 , and then introducing 
‘Europe’, ‘God’, and ‘Pegasus’ as abbreviations of ' (•,x) Our x 
‘ (tx) god x\ and ‘ (ix) peg x’. There is no question of the exist¬ 
ence of these three entities; there is question only as to their na¬ 
ture. If there is a unique object x such that god x, i.e. if 

(1) (3r)(x)(x = y .s god x), 

then (i„v) god x is that object; and otherwise, by *197, 

(2) (i.v) god .v - A. 

Monotheists and atheists now need disagree only on the truth values 
of statements such as (1) and (2), not on questions of meaningful¬ 
ness. The artificial dodge of dispensing with primitive names in 
favor of descriptions or other abstracts is a way of maintaining 
control over questions of vocabulary independently of questions of 
fact. 
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Russell undertook to resolve the anomalies of existence by admitting the word 
‘exists’ only in connection with descriptions, and explaining the whole context 
‘(i.r)(. . . x . . .) exists’ as short for ‘(3y) (x) (x = y .= (cf. his “Mathe¬ 

matical Logic,” p. 253; Principia, pp. 66 ff, 174 f). This course supplies a strict 
technical meaning for Kant’s vague declaration that ‘exists’ is not a predicate; 
namely, ‘exists’ is not grammatically combinable with a variable to form a matrix 
'y exists’. Russell’s version of descriptional existence statements perhaps recom¬ 
mends itself still as an adjunct to the procedure which I have proposed above; 
alternatively such existence statements forfeit their usual function and fall into 
uniform triviality, since r (ia)4 > 1 designates something (viz. A) even when r (3/3) 
(a) (a = p . = <t>) ^ is false. 

The difficulty which we noted earlier in connection with existence would have 
survived in another form even if the usage ‘ (x)(x exists)’ had been proscribed at 
the start in conformity with Russell’s dictum. Independently of any predicate 
‘exists’, we presuppose that a noun designates something whenever we deduce a 
singular statement from a universal quantification by substituting the noun for 
the variable. The quantification makes an affirmation regarding all entities, and we 
assume that the substituted noun designates one of those entities. So long as there 
are primitive expressions whose possession of designata is undecided, the logic of 
quantification remains indeterminate. 

Names are indeed describable in general as expressions which (unlike ‘(’, ‘ J, ’, 
V, ‘eur\ etc.) admit thus of substitution for variables in deriving singular state¬ 
ments from quantifications. Cf. my “Designation and Existence,” “Logistical 
Approach.” The sort of primitive notation which I urge is free from such expres¬ 
sions; they enter afterward as abstracts by contextual definition, and the principle 
allowing their substitution for variables emerges as a metatheorem (*231, § 31). 

Such avoidance of primitive names, in the construction not only 
of the purely logical language but also of formalized languages of 
broader scope, has incidentally the further advantage of making 
the quantification theory of Chapter II adequate for all extra- 
logical as well as purely logical applications of quantification. If 
primitive names were allowed, the quantificational axioms of the 
kind D (cf. § 15, also *103) would, have to be extended to include 
statements in which names occur in place of the variable a 1 . But 
we have seen that such names can be avoided; languages for any 
extra-logical purposes can presumably be formed merely by 
supplementing our primitive logical notations by indefinitely 
many extra-logical predicates such as ‘eur’, ‘red’, etc. Such signs 
are of the same category as ‘e’; each attaches to one or more 
variables to produce an atomic formula. 

For us, in any case, all names will be abstracts. The numerals 
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‘O’, ‘1’, etc., and indeed all expressions which are to behave in the 
manner of names at all, will turn out in the course of the definitions 
to be abbreviations of descriptions or other abstracts. 

Altogether we have three sorts of terms, or substantives: there 
are variables, which are essentially pronouns (cf. § 12), and there 
are abstracts with and without free variables. Abstracts without 
free variables, e.g. ‘x(xex)’, ‘x(xix)’, ‘x(x = x) ’ (or ‘V’), and 
‘x(x x )’ (or ‘A’), are names of definite classes. But abstracts 
with free variables, e.g. ‘x(yex)’ and ‘x(y e x . x e z)’, are not. 
They amount to substantive phrases with unattached pronouns; 
e.g., ‘x(y ( x)’ corresponds to the phrase ‘the class of all those ele¬ 
ments whereof it is a member’. Such expressions might be classed 
as name matrices, for they are related to names as statement ma¬ 
trices are related to statements. Variables themselves may also 
conveniently be classed as name matrices. 

Thus, just as formula comprise statements and statement matrices, 
so terms comprise names and name matrices. A formula without 
free variables is a statement, and a term without free variables is a 
name. All names are abstracts, but not conversely; and all 
variables are name matrices but not conversely. From the stand¬ 
point of primitive notation our only terms are variables, pronouns; 
all abstracts, and in particular all names, admit ultimately of 
elimination through D9. 
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§ 28. Stratification 


WE CAN decide whether or not a statement is an axiom of quan¬ 
tification by examining its structure with regard merely to quanti¬ 
fiers and statement connectives (ultimately, quantifiers and ‘ j. ’); 
the question is independent of the properties of Again, we 
can decide whether or not a statement is a ponential of two given 
statements by examining the structure of the statements with 
regard merely to ‘ D ’, hence ultimately ‘ j. ’. The notion of theo¬ 
rem hitherto defined turns, therefore, merely upon structure in 
terms of quantifiers and joint denial. Indeed, the notion of 
theorem was explained before membership had been discussed at 
all. The theorems are true statements which involve only quanti¬ 
fiers and ‘ j, ’ essentially; ‘e’ occurs vacuously. Even the theo¬ 
rems of Chapter III relating to membership, or to the derivative 
notions of identity, abstraction, and description, are thus inde¬ 
pendent of any special properties of membership; they would 
remain true if the sign (throughout the primitive expansions of 
those theorems) were reconstrued as meaning ‘is not a member 
of’, or ‘is older than’, or ‘loves’, or anything else we like. 

We have yet to consider the logical truths which depend for 
their truth upon the meaning of V, rather than merely upon the 
quantifiers and joint denial. In order to provide for such truths, a 
new infinite set of axioms will be introduced by way of supplement 
to the axioms of quantification. Since they are various of the 
logical truths which involve ‘e’ essentially, these new axioms will 
be called axioms of membership. The notion of theorem will then be 
extended to cover all ponentials of ponentials of. . . ponentials of 
axioms in general, whether of quantification or of membership 
(rather than just those of quantification). 

Axioms are wanted specifying circumstances under which a class 
is to be an element. What conditions shall we regard as sufficient 
for elementhood ? Common sense is of no avail in this question; 
common sense would construe every class indifferently as an ele- 

155 
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ment, i.e. as capable of membership in another class, whereas we 
already know that this status must be withheld from the classes 

(1) x(xix), x(y)~(x t y.ytx), x(y)(z)~(x t y . y t z . z « x), 

etc. (cf. § 24). It remains natural, however, to construe the range 
of elements as liberally as possible. Wherever r d(j) 1 names a class 
which is not by necessity barred from elementhood as are the 
classes cited in (1), the statement r &4> t V" 1 recommends itself as an 
axiom of membership. 

It might appear reasonable, then, to specify as axioms of mem¬ 
bership all those statements r a4> e V 1 whose denials are not theo¬ 
rems. But there are objections to this course. For one thing, we 
must decide whether the notion of theorem involved in this de¬ 
scription is to be understood in the narrower or the broader sense: 
ponentials of. . . ponentials of axioms of quantification, or ponen- 
tials of . . . ponentials of axioms of quantification and membership. 
The broader sense would vitiate the suggested explanation of 
axioms of membership, through circularity; for the term ‘theo¬ 
rem’ in the explanation would presuppose that we already knew 
what the axioms of membership were to be. We must therefore 
choose the narrower sense. There remains, however, an unanswer¬ 
able objection of another kind: the lack of a test. Even when 
‘theorem’ is taken in the narrower sense, there is no process for 
deciding that a given formula is not a theorem (cf. § 16); yet this 
decision would have to be made before we could be sure that 
r a<t> eV 1 is an axiom of membership in the proposed sense. Alleged 
axioms of membership of this kind would never be conclusively 
recognizable as such, and proofs based on them would thus be 
impossible. What is wanted rather is a characterization of axioms 
of membership according to which those axioms will always be 
recognizable. The condition imposed upon <p and a, in order that 
r &<$> e V 1 be an axiom of membership, must be a condition which we 
can check. 

So the best we can do is seek some testable condition on <p and a 
which will serve to include among the axioms of membership as 
many harmless statements r &(j> t V n as is conveniently possible, 
while excluding any actually refutable ones. Looking to (1) for a 
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suggestion of the forms to be excluded, we are led to suspect that 
the essence of the bad cases of r de<t> e V" 1 is the presence in 4> of the 
cyclic type of clause 

r a e 7i. 7i e 72.7»e«" 1 

from a around to a. We are tempted to take as axioms of mem¬ 
bership all statements e V 1 such that <j> is free from that 
cyclic feature. Actually, however, this restriction is too mild; 
bad cases turn up which are free from the cyclic feature. Con¬ 
sider, e.g., the abstract: 

(2) x~ (y)(x = y.D.xey). 

The quantification therein, viz: 

(y)(* = y D.xey), 

says that x is a member of whatever it is identical with; in other 
words, that x is a member of itself. Thus the class which (2) desig¬ 
nates is our familiar non-element x (x ex); yet (2) does not, like 
‘ x{x e x)’, exhibit any cyclic situation. Even when expanded into 
primitive notation, the formula involved in (2) turns out to con¬ 
tain no atomic formulae beyond ‘(se*)’, ‘(zey)’, and ‘(xey)’; 
and these materials are incapable of being put into any cyclic 
arrangement. 

But there is another characteristic, broader than the cyclic one, 
which is exhibited by the cyclic sorts of context and by (2) as well; 
namely, it is impossible to put numerals for the variables in such a 
way that *«’ comes to occur always in the manner ' new, - 1- 1’. This 
is the characteristic upon which exclusions of statements r «^> f V 1 
from the axioms of membership will be based. 

Let us speak of a formula as stratified if it is possible to put 
numerals for its variables (the same numeral for all occurrences of 
the same variable) in such a way that ‘e’ comes to be flanked al¬ 
ways by consecutive ascending numerals (‘n e n + 1’). To show 
e.g. that the matrix: 

(3) (y)(x ey .D (z)(y ez . = . x ty)) 

is stratified, we simply put suitable numerals for the variables: 
(1)(0 e 1 .3 (2)(1 e 2 . =. 0 e 1)). 


( 4 ) 
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The expression (4) is of course not supposed to mean anything; 
it is not a statement or matrix or name, but only a diagrammatic 
instrument like the truth table. 

The property of stratification can obviously be rephrased in 
terms of a direct test. Given any formula <j>, the following pro¬ 
cedure will decide whether or not it is stratified. First we put ‘0’ 
for all occurrences of some one arbitrary variable. Then we put 
numerals (positive, zero, or negative) for further variables, step by 
step, according to this rule: if r t a 1 appears preceded by a numeral, 
put the next higher numeral for all occurrences of a; and if 
r /3 e 1 appears followed by a numeral, put the next lower numeral 
for all occurrences of /?. If in the course of this process an occur¬ 
rence of V comes to be preceded by a numeral and followed by a 
numeral which is not the next higher, then we stop; <p is unstrati¬ 
fied. If the process yields no such result, and some variables still 
remain unsupplanted after the process has been carried as far as 
possible, then we put ‘0’ for an arbitrary one of these residual 
variables and continue. Having continued thus until all variables 
are supplanted, and still not having found </> unstratified, we con¬ 
clude that <j> is stratified. 

The notion of stratification is formulated for application to 
formula in primitive notation; hence we must not undertake to 
show that a formula <fr is stratified until we have eliminated all 
definitional abbreviations which conceal any atomic formulae 
r a e (H. The abbreviations introduced by Dl-8 need not be dis¬ 
turbed; for the set of atomic formula r a t /3 1 occurring in <j> is un¬ 
affected by those definitions, and it is only on that set of atomic 
formula that the question of stratification of <f> depends. Thus no 
definitional expansion was needed in showing stratification of (3). 
But abbreviations introduced by D9-17 and subsequent definitions 
would have to be eliminated. E.g., to test the formula in (2) for 
stratification we must first expand ‘x = y’ by DIO; the whole be¬ 
comes : 

~ (y)((z)(z t x . = . z t y) D.xty), 

which proves unstratified by the test of the preceding paragraph. 

Assumption of r a<j> t V 1 for stratified </> does not attribute ele- 
menthood directly to the classes described in (1) and (2), but we 
must beware still of indirect consequences. If we were to take as 
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axioms of membership all statements r a<j> e V 1 with stratified <j>, and 
all closures of matrices r a<t> e V 1 with stratified <j>, then we should 
encounter trouble. E.g., since ‘x ty’ is stratified, we could assert: 

(5) GO x(xty)tV; 

i.e., by |189, (y)(yeV). All classes, including those described in 
(1) and (2), would thus be admitted as elements after all. 

So stratification of <t> is not to be sufficient for truth of the 
closure of r a<j> e V 1 . Where <f> is stratified, we might try assuming 
not that the closure of r d<t> e V 1 is true but only that r d<t> t V 1 is 
true for all elements as values of its free variables; i.e., that 

(6) r (/9,). • .(fin) (01, fit, • • . , fin e V . D. &<t> t vy 

is true where a, fi lt fa, , j8„ exhaust the free variables of <j>. 
Then in particular we would affirm not (5) but: 

GO y * V . D. x(x ty) tV, 

which is quite harmless (and a mere corollary, indeed, of |189). 

In the first edition of this book the axioms of membership, in so 
far as they concern elementhood, were left as described above; 
viz., (6) for all stratified 0. However, in 1942 Rosser showed, 
by a deduction too complex for inclusion here, that the resulting 
system was inconsistent. A recent analysis by Wang suggests that 
we can put the system to rights if we not only restrict the free 
variables of r a<f> t V" 1 to elements as in (6), but also restrict the 
bound variables of <j) itself to elements. This can be done by 
changing each quantification r (7)x" 1 within </> to r ( 7)(7 tV.D x) 1 . 
and each quantification r (37)x 1 to r ( 37)(7 e V . x) 1 . 

Let us then take as axioms of membership all statements of the 
form (6), i.e. the closures of all matrices of the form 

(7) 

such that <t> has no free variables but a, fi u fi 2 , . . . , fi n and is formed 
from a stratified formula by restricting all bound variables to ele¬ 
ments. E.g., since the formula: 

(3y)(x e y . (z)(y tz.D.ztw)) 

is stratified, the following is an axiom of membership: 

(8) (w) wt V.D x(3y)(y«V. xty. (z)(ze V .D ’ y ez.D. ztw)) tV. 
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Again, since (3) is stratified, this is an axiom of membership: 
x(y)(ytV .Dlx ty .D (z)(z« V .Dly e z . = . xey)) t V. 

Here there is nothing to correspond to the antecedent of (7), because 
there are no free variables; the n of (7) is here 0. In general we are 
to understand (7) thus as simply losing its antecedent when n = 0. 

Strictly all formulae are to be thought of as in primitive notation. 
But then, since in primitive notation all quantifiers are universal, 
the restriction of a bound variable to elements consists strictly in 
changing only r (y)x n to r ( 7)(7 e V .Dx) 1 and never r (37)x n to 
r (37)(7 t V . x) 1 - Properly thus the restriction of bound variables 
turns r (37)x' 1 . or r ~ (t) ~ x"\ into 

(9) r ~ ( 7 X 7 «V .D ~ x) 1 

rather than r (37)(7 e V . x)" 1 - However, (9) in turn becomes 
r (37)(7 t V • x) n by *100 and *123 (and D8); so we may in practice 
conveniently treat (8) and similar cases directly as axioms of mem¬ 
bership, the excursion through D8, *100, and *123 being left tacit. 

§ 29. Further Axioms of Membership 

IF x and y are the same thing, then whatever can be said of x is 
naturally true likewise of y. Closures of conditionals of the sort: 

x = y ,D:...x y... 

should thus be true, ‘... x .. .' and ‘. . . y . . . ’ being thought of as 
formulae differing only in free occurrences of ‘x’ and *y\ The 
following, accordingly, is wanted as a metatheorem: 

(1) If <t>' is like 4> except for containing free occurrences of a' 
in place of some free occurrences of a, then \- r a = a' .D. 4> = <j> n . 

This is the principle of the substitutivity of identity, in one of its 
forms. Broader forms of the principle are likewise wanted, dealing 
with terms f and in general rather than merely with variables 
a and a. Now in order to provide for (1), and its extension to 
terms f and f', we must supplement our axioms. 

In specifying these new substitutivity axioms it is sufficient to 
limit our attention to (1); the extension to terms f and f' can be 




§29 FURTHER AXIOMS OF MEMBERSHIP 161 

accomplished afterward (§ 30) without further assumption. Nor 
do we need to take as axioms all the statements covered by (1); 
it turns out that we need take only the cases where <j> is atomic. 
Also it turns out that the occurrences of a replaced by a' can be 
limited to a single one; and ‘ = ’ can be weakened to ‘3 ’. Freedom 
of the occurrences of a and a' in <f> and <f>' no longer needs to be 
stipulated, since an occurrence of a variable in an atomic formula is 
necessarily free. The axioms are therefore describable simply 
as the closures of formulae 

(2) r a = a' .3. * 3 <j> n 

such that <f> is atomic and </>' is formed from <j> by putting a' for an 
occurrence of a. Thus two of them are: 

(3) (s)(y)(*) x = y .3: x « % .3. y e z, 

(4) (z)(y)(x) x = y .3: z e x .3. z e y; 

and there are further ones corresponding to atomic matrices <j> of 
any non-logical forms — forms other than r /3 « y 1 . 

In view of DIO, the axiom (4) and all others sharing the same 
form 

(5) r (a)(/3)(y)(a = 0 .3:y e a .3. y e 0) 1 

are actually redundant assumptions; for they are readily seen to 
be theorems on the basis of *103 and *100. In consequence it can 
be shown that (3) would suffice as sole substitutivity axiom, if we 
were disposed to disregard non-logical atomic matrices and pro¬ 
vide for only such theorems as are logical formuke. But the notion 
of theorem is intended as an approximation to the general notion 
of logical truth; and it seems desirable, as in Chapter II, to pursue 
our codification of this notion in such a way that it will apply to 
statements whose atomic formulae have any sort of subject matter, 
logical or otherwise. Accordingly the more generous set of sub¬ 
stitutivity axioms is adopted. The redundant cases described in 
(5) are admitted along with the rest merely because an added 
clause expressly excluding them would be an idle complication. 

The substitutivity axioms are appropriately classifiable as 
further axioms of membership, supplementary to those of § 28; 
for the substitutivity of identity appears as a condition on ‘e’ 



162 EXTENDED THEORY OF CLASSES §29 

when ‘ = ’ is expanded by DIO. We turn next to the consideration 
of yet a third set of axioms of membership. 

When we found that the common-sense idiom ‘the class of all 
entities x such that . . . x . . .’ could not in general be trusted to 
express a class, we weakened it to read ‘the class of all elements x 
such that ... x symbolically ‘x(. . . x . . .)’. We now need 
axioms of membership providing in general for the existence of the 
class described by this weakened idiom. There is to be a class 
y whose members are all and only those elements x such that 
... x .. .; symbolically 

(6) (3y)(x)(x e y . = . x e V . . . . x . . . ), 

since ‘xtW’ means that x is an element. All statements of the 
kind (6), then, and the closures of all matrices of this kind, are to 
be axioms of membership. More rigorously stated, the axioms of 
membership are to include the closures of all formulae 

(7) r (3/3)(a)(a e B . = . a t V . 0) 1 

such that B is not a nor free in p. 

To sum up, then, the axioms of membership comprise the clo¬ 
sures of the formulas described in connection with (6) of § 28 and 
(2) and (7) of the present section. The notation ‘[- p’ will be used 
hereafter to mean that the closure of 0 is a theorem in our newly 
extended sense: an axiom of quantification or membership or a 
statement derivable from such axioms by one or more steps of 
modus ponens. The totality of theorems ceases to be determined 
by *100-*104 alone, and comes rather to be determined by *100— 
*104 plus the following: 

*200. If 4> has no free variables beyond a, Bu Bt, ■ • ■ , Bn, and is formed 
from a stratified formula by restricting all bound variables to elements, 
[- r Bi, Bi, ■ ■ ■ , Bn e V .3. a<j> e V 1 ([- r a<j> t V 1 when n = 0 ). 

*201. If cj> is atomic, and <f>' is formed from <j> by putting a! for an 
occurrence of a, then f- r a = a' .D. <f> D <f> n . 

*202. If B is not a nor free in <j>, , 

|- r (3/3) (a) (a e B ■ = • « « V . p) 1 . 

What have hitherto been shown to be theorems continue to be 
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theorems; but we can now establish further theorems, with help of 
*200-*202, which were lacking before. Two simple ones are the 
following, which affirm that V and A are elements. 

f210. V 6 V 

Proof. ‘(y){y e # . = . y e x) ’ is stratified; hence 
*200 x(y)(y t V .3 y e x . = . y t x) e V (1) 

*100, *123 (& D15, 10) [1 =] 210 

f211. A e V Proof similar, with ‘~(y)' in place of ‘ (y)’■ 

With help of f210 and *202 it is now possible to prove a theorem 
to the effect that, whatever y may be, there is something else.; 
in other words, that there is more than one entity. 


f212. (y) 

(3*)(* ^ y) 



Proof. 




*100 

z = V . z « V .3 ~ (u*. = . 

z e V . z t x) 

(1) 

*149 (& D12) 

[(*)1 3- 210 3](3z)~(z e x .= 

= . z e V . z.e x) 

(2) 

*130 

~ M0 e *. = .z e V . z e *)[i 

= 2] 

(3) 

*121 (& DIO) 

x = y .3. [3 =]~0)0 * * 

= . z e V . 2 e y) 

(4) 

*100 

[4 3:] 0) 0 «x . s, z e V . z i 

y)D. x y* y 

(5) 

*202 

(3*) 0) 0 « x . =. z e V . z e y) 


(6) 


*149 [(*)5 D. 6 3] 212 

Ways of protecting logic from contradictions of the sort considered at the 
beginning of § 24 were proposed in 1908 by both Russell and Zermelo. Russell’s 
way, known as the theory of types, is restated in Whitehead and Russell, vol. 1, pp. 
37-65. The theory is highly complex and at some points obscure. Some of its 
complexity was resolved in 1914, when Wiener reduced relations to classes (cf. § 36). 
The dispensability of the most complex and obscure part of the theory, viz. the 
part having to do with the so-called orders of propositions and propositional func¬ 
tions, was pointed out by Chwistek in 1921. (See Church’s review, also Ramsey, 
pp. 20-29, and my “Axiom of Reducibility.”) Thus simplified, the theory of types 
is substantially this: each entity is conceived as belonging to one and only one of a 
hierarchy of so-called types; and any formula which represents membership as 
holding between entities of other than consecutive ascending types is rejected as 
meaningless, along with all its contexts. (Cf. my “Theory of Types.”) In partic¬ 
ular, thus, 'xtx' and all its contexts are meaningless; and similarly for 'x e y . 
y e x’, ‘xty.yez.zex’, etc. The prefix ‘the class of all entities x such that’ 
survives, but all the matrices in connection with which this prefix caused trouble are 
now banished from the language. 
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Under Russell’s theory there is no one exhaustive class V, but rather an infinite 
array of analogues each of which exhausts a type; for a class is forbidden to draw 
its members from more than one type. In similar fashion every other logically 
definable entity, e.g. A, the identity function / (§42), and indeed each number, 
loses its unity and gives way to infinitely many replicas. Intuitively all these 
cleavages and reduplications are of course unwelcome; and technically also, for 
they call continually for more or less elaborate technical manoeuvres by way of 
restoring severed connections. 

Zermelo’s scheme, refined in 1930 by Skolem, is in effect as follows. Special 
axioms are adopted providing for the existence of the null class, the class of all sub¬ 
classes of a given class, the class of all members of members of a given class, and the 
class having any one or two given things as sole members. Then a so-called 
Aussonderungsaxiom is added which allows generation of further classes via the 
idiom ‘the class of all entities x such that x e y and . . . x ...The Aussonderungs - 
axiom is the precursor of *202, with which it is identical except for exhibiting a new 
variable in place of V. For variants of the Zermelo system see Fraenkei’s Einlei - 
tungy my “Set-Theoretic Foundations,” and Ackermann’s “Mengentheoretische 
Begriindung.” 

Zermelo’s theory is free from the drawbacks noted in connection with Russell’s, 
but has drawbacks of its own. Whereas under Russell’s scheme an abstraction 
prefix ‘the class of all entities x (of given type) such that’ can be applied outright 
to any meaningful formula, under Zermelo’s scheme generation of classes tends to 
be laborious and uncertain. Moreover, there is for Zermelo no class which em¬ 
braces more than an infinitesimal proportion of the totality of entities; thus there 
is nothing remotely analogous to an exhaustive class, nor does any class have a 
complement x (cf. § 33). 

My “New Foundations” strikes a middle course between Russell and Zermelo. 
Zermelo’s initial axioms of class existence are dropped, and the idiom ‘the class of 
all entities * such that’ is construed as effective whenever applied to a stratified 
formula. There is an echo here of Russell, since a stratified formula is one which 
could be reconciled with the theory of types by some assignment of types to the 
variables; but types themselves, and the cleavages and reduplications which they 
involve, are abandoned. Unstratified formulae remain meaningful, and can often 
even be shown by more or less devious reasoning to determine classes — just as, in 
the system of this book, unstratified formulae can often be shown to determine 
elements. In contrast to Zermelo’s theory, “New Foundations” countenances an 
exhaustive class V and a complement x of every class x. Instead of repudiating the 
most inclusive classes, the system countenances a universe which is symmetrical as 
between small and large — as the existence of complements testifies. Within the 
lower half of this symmetrical universe, a universe fulfilling Zermelo’s theory is 
contained (provided that two rather appendical axioms of Zermelo, viz. the axiom 
of infinity and the axiom of choice, be appended also to “New Foundations” or else 
dropped altogether). 

In “New Foundations,” unlike Zermelo’s system, generation of classes is as 
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convenient as in Russell’s theory; for the prefix ‘ the class of all entities * such that’ 
is now available in connection with formulae corresponding to all Russell’s meaning¬ 
ful formulae. Nor is the phrase ‘all entities x’ here subject, as with Russell, to the 
parenthetical qualification ‘of given type’. Moreover, the resulting abstracts may 
be substituted for variables at will without fear, as in Russell’s system, of rendering 
the context meaningless. "New Foundations” shares with Zermelo’s system the 
virtue of involving no complications in the matter of meaningfulness of formulae. 

The system of theorems determined by *100-*104 and *200-*202 could be con¬ 
verted into the system of .“New Foundations” merely by dropping *200, deleting 
from *202, and stipulating that the tf> of *202 be stratified. 

There is, however, a technical inconvenience to which “New Foundations” and 
Zermelo’s system are alike subject, and from which Russell’s scheme is free; namely, 
the survival of conditions (e.g. ‘xix’) to which no classes correspond. This 
divergence from Russell is a matter of liberalization rather than restriction, of 
course, for it consists in the retention of formulae which Russell rejected as meaning¬ 
less; but it nevertheless proves awkward in some cases. One unnatural result is 
that the principle of mathematical induction (§ 44) cannot be established in its 
general form unless we adopt special axioms for the purpose; cf. Rosser, “Con¬ 
sistency,” “Definition.” 

The system set forth by von Neumann (1926), and recast by Bernays (1937) 
in a form more directly comparable with tradition, overcomes this kind of difficulty. 
Matrices which would otherwise have no classes corresponding to them are now 
accorded classes of a special sort, incapable of being members. This proscription 
of membership seems to work as well as the repudiation of the classes themselves 
so far as the avoidance of contradictions is concerned, and it is a milder and techni¬ 
cally much more convenient sort of restriction. 

The basic difference between the von Neumann-Bernays system and the system 
developed in this book resides in the conditions of elementhood; whereas the ele¬ 
ments of the von Neumann-Bernays system comprise approximately the classes of 
Zermelo, the elements of the present system comprise all the classes of “New 
Foundations.” For von Neumann and Bernays, no element embraces as members 
more than an infinitesimal proportion of the totality of elements; consequently the 
class V of all elements is not an element, nor is the complement x of any element x. 
For the present system on the other hand V is an element (y210) and so is x for 
each element x (f274); the totality of elements is symmetrical as between small 
and large. 

While the elements of the present system answer to the classes of “New Founda¬ 
tions,” it is not clear that von Neumann’s elements correspond .exactly to Zermelo’s 
classes. Thus, whereas the lower half of the symmetrical universe of “New Foun¬ 
dations” contains Zermelo’s universe, it is not altogether clear that the lower half 
of our present symmetrical totality of elements contains all of von Neumann’s 
elements. This relationship could be assured by supplementing *200-*202 with a 
further axiom of membership: 

<*) 


*«V.V (3y)(* Cty. y“x = V) 
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(in the notation of §§ 38, 40), which answers to half of von Neumann’s 4.2; but let 
us beware, for perhaps this axiom is actually refutable on the basis of *100-*104 
and *200-*202. 

Whereas the present system would appear to be much more liberal than von 
Neumann’s in the matter of elementhood, von Neumann’s distinguishes itself on an 
important point of economy; for his system demands, over and above *100-*104, 
only a finite number of axioms. 

In the present system we regain much of the manipulative convenience which 
was enjoyed in the days of false security before 1902 and sacrificed in the systems 
of Russell, Zermelo, and “New Foundations.” We can compose formulae in the 
simple manner of § 23 without fear of meaninglessness, we can apply abstraction 
prefixes to any of these at will, and we can substitute the results for variables 
(*231) without thought of stratification or auxiliary existence theorems. We pay 
for this in clauses of elementhood r f e V"*, but at bargain rates. 

The contradictions which instigated this whole series of researches, from Russell 
and Zermelo onward, were implicit in the inferential methods of uncritical common 
sense; and the various reformulations of logic which have been proposed for 
avoiding the contradictions have been correspondingly artificial and foreign to 
common sense. The least artificial and at the same time the technically most 
convenient formulation would seem to be that which comes as close as it can to the 
over-liberal canons of common sense without restoring the contradictions. But 
the more closely we approach this ideal in point of liberality, the more risk we run 
of subtly reinstating a contradiction for posterity to discover. If we undertake to 
prove consistency, moreover, we find ourselves in this curious predicament: the 
proof would itself have to proceed by logic, and hence would be conclusive only in 
proportion to our prior confidence in the consistency of the logic used. The most 
we can hope for, in such a proof, is to show that one theory is consistent if another 
is; and this is interesting just in case the one theory was more suspect than the 
other. Intuitive obviousness thus becomes the last arbiter — and a fallible one, 
in view of the contradiction which Russell was able to draw from common-sense 
logic. But there are degrees of obviousness; future analysis may enable us to 
derive logic from a set of principles yet more obvious and natural than those which 
Russell discredited. Meanwhile we adopt a smooth-running technique which does 
not appear to be inconsistent. Our risk of inconsistency is, at any rate, no greater 
here than in “New Foundations”; for Wang has shown that the one system is 
consistent if the other is. 
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§ 30. Substitutivity of Identity 

THE SUBSTITUTIVITY of identity, which has been postulated 
in a rudimentary form (*201), will now be established in its full gen¬ 
erality. Seven metatheorems *221—*227 will be proved which are 
exactly analogous to the principles * 120—* 123 and *186—*188 of the 
substitutivity of the biconditional. As a lemma for *221 it is 
convenient first to prove the following special case thereof (which 
is at the same time a case of *223). 

*220. If 4>' is formed by putting I' for an occurrence of f in a quasi- 
atomic context 4>, then \- = £' ,D. <j> = 4> n . 

Proof. Case i: f and f' are variables. 

Case ia: 4> is atomic. 1 

*201 (&hp) Kf = r \3.4>D4>'\ (1) 

*100, *123 (& D10) h r f = f' .2. U = t 1 

*201 (&hp) 2.4>'2 4>\ (2) 

*100 (& D5) )- r [l ■ 2 .D]220 n . 

Case lb: <f> is not atomic. Then the expansion of <f> into primitive 
notation makes the occurrence of the variable f give way to many 
occurrences, because of the reduplications arising from expand¬ 
ing V, and ' = ’ into terms of 1 j, but these occurrences 
will have a uniform atomic context xf, which occurs over and over. 
Let -p’ be the corresponding part of <j>'. By Case la, 

b r f - (1) 

From the fact that 4> and 4>' are quasi-atomic contexts of f and U, 
it is clear that the specified occurrences of and f/' in the primitive 
expansions of <f> and </>' will not fall within any quantifications 
governed by r (f) n or r (f')" 1 ; in other words, f and f' are distinct 

1 From the fact that f is a variable we cannot conclude that its quasi-atomic 
context is atomic; abstracts may still turn up at other points in <£. But may 
be atomic; the atomic is a special case of the quasi-atomic. 
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from all the variables ft,. . . , ft with respect to which the occur¬ 
rences of tp and \f/' in question are bound in p and p'. Hence 

(1), *117 Kf = f'.3 (ft) .. . (ft)(* - PV 

*121 D. p = p n . 

Car^ 2: f and f' are abstracts. 

Case 2 a: p and p’ are r a e f and r a e f n . *103 (& DIO). 

Carf p and p' are of other forms than r a e and r a e f n . 
Still, definitional expansion of p and p 1 reveals that the occurrences 
of f and £' in question, in p and p', are free occurrences having the 
immediate contexts r a t and r a e t n where a is foreign to f and 
r (cf. § 26). Thus 

*121 J- r (a)(a £ f . =. a e f') 3. p = i j> n , q.e.d. (cf. DIO). 

Case 3 : f is a variable and f' an abstract. 

Case ja: p and p' are r a e f" 1 and r a £ f n . *103 (& DlO). 

Case 3 b: p and p' are r &p £ f" 1 and r 6 tp £ f'L Let 7 be new. 

*121 h r (T)(7 6f.-.7er)3. 

( 37 ) (7 = <$iA . 7« f) = ( 37 ) (7 = #- ts 

q.e.d. (cf. DIO, 12). 


Case jc: p and p' are r f e ip and r f' e t;" 1 . Let 7 be new. 

*134 |- r f = f'.f£,.D (37)(7 = f'.7^)". (1) 

*100 b r [l ■] f = f' .3: f e rj .3 Rl" 1 . (2) 

*220 (Case 1) b r 7 = f ■-} ■ 7 e V • — • f < (3) 

*100, *163 b r (37)([3 .] 7 = f. 7 £ r;) D. f £ j/ 1 . (4) 

*121 (& DIO) b r r = f'-3:.[4s:]Ri D. f £, 1 . (5) 


*100 (& D5, 12) b r [2.5 .3:.] f ,3: £ e 17 . = . f' £ v 1 , q.e.d. 
Case 3 d: p and p’ are extra-logical quasi-atomic contexts, as of 
Dar~a„ of § 26. Proof similar. 

Case 4 : f is an abstract and £' a variable. 

*100, *123 (& DIO) b r r= f'.3-f'= f 1 
*220 (Case 3) 3. p' s 

*100 3.0 = <^ n . 

*221. If p' is like p except for containing f' at a place where <j> con¬ 
tains f, a«c£ ai,. . . , a n exhaust the variables with respect to which 
those occurrences of f and f' are bound in p and <p', then 
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Proof. The occurrences of £ and £' in question have as immedi¬ 
ate contexts either quasi-atomic 1 contexts p and \p' (Case i) or else 
quantifiers r (£) n and r (£') n (Case 2 ). In Case 1, £ and £' may be 
abstracts or variables; in Case 2 they must be variables. Case 2 
has little practical significance, but must be dealt with because 
*221, taken literally, includes it. 

Case 1 . *114 |- r ( ai ) . . . («„)(£ = £') 3. £ = S n 

*220 D.tp = \f/ n . (1) 

Let r (ft)xi 1 , r (ft)x 2 n , • • • , r (Pm)Xm 1 (m ^ 0) be all the quanti¬ 
fications in <j> which contain the given occurrence of £, and let 
r (ft)xV, • • • » r (ft»)xV be the corresponding parts of <£'. Then, 
if |S,• is a free variable of f (or £'), the occurrence of £ (or £') will be 
bound in $ (or <j>') with respect to /3<; wherefore, by hp 2, j3< will 
be one of aa,. . . , <*„. For each i from 1 to m , therefore, d* is one of 
ai, . . . , a n or else not free in £ nor in £'. Thus none of ft,..., ft, 
is free in r (ai) . . . (a n )(£ = f')" 1 . Hence 

(1), *117 h r («0 . . . (a n )(f - £') D (ft) • • • (ft»)(* - *V 

*121 3.4>^4> n . 

Case 2 (see above). If the variables £ and £' are the same then 
<f>' is <f>, whereat r 221 1 by *100; if on the other hand they are 
distinct, 


*171 |-T(y)212 =](£')(3£)(£ £')L (1) 

*110 HTi 3 ]( 3 f)(f ^ n n . ( 2 ) 

*130 h r ~ (£)(£ = rot- 2] 1 - (3) 

*ti 4 h r («o ...(«»)(£ = ni-t-r. (4) 


The occurrence of £ in which we are considering is bound to the 
occurrence of r (£) n which contains it (cf. § 14) — an occurrence of 
r (£) n which is in <£ and not, of course, in £. Thus the occurrence of 
£ is bound in <j> with respect to £, wherefore, by hp 2, £ is one 
of ai, . . . , a n . Thus 

(4), *117 [- r (cn) .. . («„)(£ = £')D (£)(£ - £T 
*100 3:[3D.> = <t> n . 

1 Possibly atomic, in case £ and £' are variables. See preceding footnote. 
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*222. If 4>' is like <p except for containing f' at some places where <j> 
contains and ai,. . . , a„ exhaust the variables with respect to which 
those occurrences of p and f' are bound in p and p', then 

b r ("0 • • • (««)(£ = V) 3. P = P n - 
Proof from *221 like that of *121 from *120. 

*223. If p' is like <f> except for containing free occurrences of tg' tn 
place of some free occurrences of f, then \- = f' .3. <j> = p n . 

Proof: *222 (n = 0). 

*224. if b r r = p' is formed from p by putting f' /or some 

occurrences of b r $ = </> n - 

Proof from *222 like that of *123 from *121. 

*225. // V is like t) except for containing f' at .totoc places where r/ 

contains and a i} . . . , a„ exhaust the variables with respect to which 
those occurrences of f and are bound in -q and rf, then 

b r M... («.)(r = r'P.i? = 

Proof: *222, *117 (& DIO). 

*226. If T) is like ri except for containing free occurrences of £' in 
place of some free occurrences of then \- r f = f' .D. tj = ij n . 
Proof: *225 (n = 0). 

*227. // b r f = 17' is formed from 17 fey putting f' /or romr 

occurrences of f, t/?rw b r, 7 = 1 1 - 

Proo/ from *225 like that of *123 from *121. 

The following variant of *223 is occasionally useful. 

*228. If p' is like p except for containing free occurrences of / in 
place of some free occurrences of f, then 

b r r = r t .p n . 

Proof. *100 b 1223 3] 228 n . 

§ 31 . Substitution for Variables 

SINCE x{. . . x . . .) is the class of all elements x such that 
. . . x . . . , any entity x will belong to the class if and only if it is an 
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element and . . . x ... . That the closures of such biconditionals 
are theorems is established in the following metatheorem, which 
will subsequently be generalized (*235). 

*230. (- r a e a e V . (ft 1 . 


Proof. 

Let |3 be new. 


*171 

b T>)(y)190 a](a)C8)(« « jS .3. a £ V)\ 

(1) 

*114 

b r [l 3:]a£j8.3. aeV" 1 . 

(2) 

*110 

b r (a) (a: £ (8 .3 p)D .' a c fi .3 p 1 


*100 

3: [2 Ja £ /3 .3. a £ V . p 1 . 

(3) 

*100, *163 

b r (3jS)([3 .] a £ 0 . («)(« e /3 .3 *))3. aeV.p 1 . 

(4) 

*100 

b r a e V . p .3 p 1 . 

(5) 

*135 (& D9) (- r a £ |3 . (a)(a £ /3 .3 p) .3. a e dp 1 . 

(6) 

*121 

b r (a)(a e /3 .s-. a e V . p)D!. 



[[6=a e V . p (a)5] .3. a £ dp 

(7) 

(7), *163 

b r [202 3:]a£V.<^.3.a£ dp~*. 

(8) 

*100 (& D9) b r [4.8.3] 230' 1 . 



Passing mention was made earlier (§ 15) of logical truths of the 
form r (a)p 3 p 1 , where p is like p except for containing a name £ 
in place of the free occurrences of a. The notion of name has since 
been given an explicit status (§ 27), and now those logical truths 
are forthcoming as theorems. This is established in the following 
metatheorem, which is an extension of *103. 

*231. Ifp is like p except for containing free occurrences of £ wher¬ 
ever <f> contains free occurrences of a, then f - r (a)p 3 p 1 . 


Proof. Case 1: £ is a variable. *103. 

Case 2. £ is an abstract l px~ l . Let a' be new, and let p' be 
formed from p by putting a' for the free occurrences of a. 


*223 (& hp) 

b r<x ' = fix -3. P' = P 1 


*122 

3: Q03 =.](a)</> 3 p 1 . 

(1) 

*202 


(2) 

*230, *123 

b r (3«')((3)(0<«'.=.|Mx)[ = 2]L 

(3) 

*171 (& D10) 

b r [3 =](3 =$ X r. 

(4) 

(1), *163 

b r [4 3] 231" 1 ! 



This metatheorem is used constantly in the sequel. It embodies 
the principle of application or specification — the principle which 
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leads from a general law, a universal quantification r (a)p~ l , to 
each special case p falling under the general law. We get p from p 
by substituting a name or name matrix f for the variable of 
quantification a. It is important to note that the substitution 
must exhaust the free occurrences of a in p\ thus *231 gives us: 

(x) (x - x) 3. A = A 

and: (z) (*)(*=*) 3. y(z e y) = y(z e y) 

as theorems, but not: 

(x) (x) (x = x) 3. x = A. 

The latter is clearly false. In thus requiring exhaustive substitu¬ 
tion, *231 resembles *103 and *134 and differs from *120-*124, 
*186-* 188, *220-*228. 

The following metatheorem extends *134 as *231 extends *103. 

*232. Ifp is like <t> except for containing free occurrences of f where- 
ever <f> contains free occurrences of a , then \- r p D(3a)0 1 . 

Proof. *231 (& hp) [- r [(a)135 3] 232 1 . 

The next extends *231 to the case of many quantifiers. 

*233. If p is like p except for containing free occurrences of ft, 
ft, . • . , f n in place respectively of all free occurrences of on, a 2 , ... , 
a n , then 

f- r (“i)(“2) • • • (a n )P 3 P^- 

Proof. We may assume without loss of generality that <*i, a 2 , . . . , 
a n are distinct; for if a < were a, then the hp would demand that 
ft be ft, so that the extra mention of and ft could simply be 
dropped. Now let p a be formed from p by putting new distinct 
variables ft, . . . , /3 n for all free occurrences respectively of on, 

. . . , a n ; and, for each i from 1 to n, let pi be formed from pi i 
by putting ft for ft. 


171 

|- r («l) .. . (ot n )p 3 (ft) . . 

• • (ftO'Ao" 1 

231 

3 (ft) • ■ 

.. (ft)ft n 

231 

3 (ft) . . 

• ■ (ftO'ft" 1 

231 

3 

But p„ is p. 
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The reason for introducing new variables ft,, ft, in the above 
proof, is as follows. If ft were formed rather by putting ft for tn 
in ft and ft by putting ft for ct2 in ft, and so on, then the assertion: 

b r («i) ■ ■ • («»)<£ 3 (at) • • • (ajft" 1 
would not be justified by *231 in case any of at, , a„ happened 
to be free in ft; and correspondingly for each succeeding step of 
the proof. The /3’s obviate this difficulty, foreign as they are to 
the fs. 

Note that *233 does not demand a distinct f for every distinct a. 
All of the following, e.g., are theorems according to *233: 

(s)(w) (*)(y)(# e z .V. z e y) 3 ", w e z .V. z e V, 

(z)(w) (x)(y)(x e z .V. z t y) 3 : w « z .V. z e w, 

(z) ( x)(y)(x e z .V. z t y) 3: V e z .V. z e V, 

(z) (x) (y)(x e z .V. z t y) 3 : z e z .V. z c z. 

Hereafter, when a daggered numeral is used to justify a step of 
proof, a tacit inference according to *233 will commonly be in¬ 
volved. The following line, e.g., might typically occur in a proof: 

f 190 y t x(z t x) .3. y t V. 

Rendered in full, the reasoning here is as follows: 

*233 C(*)(y) 190 3:] y e x(z e *) .3. ye V. 

Similarly the first line in the proof of *234, below, amounts to 
this: 

*233 |- r [(*)182 3.]f = ft 1 . 

In effect, thus, the matrix following the segregated initial quanti¬ 
fiers of a theorem (the matrix ‘xty .3. zeV’ in the case of fl90, 
or ‘x = x’ in the case of fl82) is tacitly reconstrued in proofs as 
containing free occurrences of any desired terms in place of the free 
variables. Under this procedure the first two lines in the proof of 
*230 would have given way to the single line: 

f 190 (- r aeft3. a«y. 

Note, however, that this innovation relates only to numerals with 
daggers attached. Divorced of its dagger, the numeral continues 
to serve as an abbreviation of just the one specific formula which 
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actually appears after the segregated initial quantifiers (if any) 
in the original statement of the theorem; the undaggered numeral 
‘190’, e.g., serves in proofs as an abbreviation of the matrix 
‘x e y .D. x e V’ and this only. 

The following metatheorem depends both upon substitution for 
variables, as of *231 and *232, and upon the substitutivity of iden¬ 
tity as of *223. 

*234. If a is not free in and is like <f> except for containing free 
occurrences of f wherever <j> contains free occurrences of a, then 
(a) \- r ip = (a) (a = f .3 0) 1 and (b) b V — (3 a) (a = f . p) 1 . 


Proof. fl82 (- r f = t 1 . (1) 

*231 (&h P ) b r («)(«-?■:>*) 3-dw (2) 

*232 (&hp) b r [l.]i/0 (3a)(« = t. </>)-<. (3) 

*223 (& hp2) \- r a = ^ .D.p = P 1 . (4) 

*100, *117 (& hp) \- r pD (a)([4 .> = r -3 P) 1 - (5) 

*100, *163 (& hp) r (3a)([4 .> = f . p) D i/T (6) 

*100 f- r [5.2 .D]234aT 

*100 bT3.6.3]234b 1 . 


One of the theorems covered by *234b has already been con¬ 
sidered separately, in fl85. 

The stipulation that a not be free in f is clearly essential to 
*234; otherwise we might take f as a itself, e.g., thus concluding 
that 

b r p = (cc) (a = a .D p) 1 and b r< t > = (3a) (a = a . p) 1 

and hence that b r P = ( <x)P 1 and b r P = (3a)</> n for all choices of 
p and a. 

The principle noted at the beginning of the section, in *230, now 
achieves its general formulation with help of *231. 

*235. If p is like p except for containing free occurrences of f where- 
ever p contains free occurrences of a, then 

b T e ap • = ■ f « V . p 1 . 

Proof. *231 (& hp) b r [(a)230 D] 235 n . 

This principle provides a very useful way of paraphrasing 
r f t dp 1 so as to eliminate the abstract; a simpler and more con¬ 
venient way, indeed, than that provided by D9 and D12 them- 
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selves. But *235, unlike the definitions, is not always immediately 
applicable; transformation of e into an alphabetic variant 
is sometimes necessary in preparation. E.g., the formula: 

(1) y «x(3y)(y ex) 

cannot, as it stands, be paraphrased by *235; for it is impossible to 
construct a formula f which is like ‘ (3y) (y e #) ’ except for con¬ 
taining free occurrences of ‘y’ wherever ‘(3y)(y e x)’ contains free 
occurrences of ‘x’. The alphabetic variant: 

y e x(3z)(z « x) 

of (1), on the other hand, can of course be paraphrased by *235; it 
becomes: 

ye V. (3z)(z t y). 


§32. Further Consequences 

THIS SECTION embraces a variety of theorems and metatheo¬ 
rems on membership, identity, and abstraction. The first of these 
theorems was recognized earlier (§ 23) as the shortest of all logical 
truths (from the standpoint of primitive notation). 

f240. (x) (x € x) Proof. f!92 A eA (1) 

*232 [1D](3*)(* lx) (2) 

*130 240 |> 2] 

|241. (3*)(*e*) Proof. *232 [210 3] 241 

The next theorem shows that an element x is identical with y if 
and only if 

(1) (z)(x e z .D. y e z). 

That (1) follows from ‘x = y’ is clear, indeed, regardless of whether 
x is an element (cf. *201); on the other hand the inference of 
'x = y’ from (1) does depend on elementhood of x, for if x is not a 
member of any class then (1) is trivially true independently of y. 

f242. (y)(x) x eV .2x = y .= (z)(x e 2 .D. y e z) 

Proof. 

*201, *117 x = y .D (z)(* e z .D. y e z) (1) 

*231 (z)(# e z .D. y t z) D : x e w{x - w) .D. y e zv(x = zv) 
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*235, *123 3:*e V[.182].3.y e V.* = y (2) 

*100 £2 x e V .3 : (z)(* e z .3. yez)D.yeV.x = y 

*100 3[43 :[]# = y .= (z)(,v e z .3. y c z) 

Note that the third line of the above proof is a compressed 
rendering of two duplicate steps, as follows: 

*235, *123 3: x e V £.182].3. y e zv(x = w) 

*235, *123 3:#«V.3.yeV.# = y (2) 

Such telescoping of duplicate steps, in connection with *123 and 
likewise *224, will be usual hereafter. It occurs again in the 
second line of the proof of *244, below. 

When the ‘xez.O.yez’ of f242 is strengthened to ‘xtz. = . 
y e z’, we still have a theorem: 

f243. (y)(x) x e V .3: x = y .= (z)(x t z . = . y t z) 

Proof similar, using *223 instead of *201. 

The following is the main principle regarding equation of ab¬ 
stracts. 

*244. |~ r <$<jS» = <5^ . = («)(<* e V .3. p ss p) 1 . 

Proof. *171 (& D10) }- r dp — Sty .= (a) (a e dp .=. a e Sty) 1 

*235, *123 =(a)(a ( V.f 3 . ae V.ff 

*100, *123 3 ( a )(aeV.3.* = P) 1 . 

Various corollaries follow. 

*246. If x is formed from p by putting pfor some free occurrences of 
r ot e dp 1 , then (- r Sty = dx 1 . 

Proof. *100 |- r [230 .]« e V .3: a e d<t>. = p 1 

*122 (&hp) D-P^x 1 - (1) 

*244 |- r 245[> («)1]T 

*246. |- r dp = d(a € V . p) 1 . 

Proof. *100 (- r a « V .3: p =. a « V . p 1 . (1) 

*244 \- r 246[ = 

*247. r d<f> = d(a ( V .3 p) 1 . Proof similar. 

*248. |- r dp = d(a e V.= p) 1 . Proof similar. 
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*249. |- r dt<j> = V .= («)(« e V .D <£) n . 

Proof, f 189, *224 \- r &4> = V ■ = ■ &4> = <«(« « V) n 

*244 = (a)(a e V .3: <#> =. a « V) n 

*100, *123 s (a)(«eV.D <t>)\ 

The first line of the above proof involves construing f 189 as 
ry = <$(a e V) 1 . This deviation from the original form of fl89 
turns not only upon use of *233, such as is regularly left tacit in 
connection with daggered references (cf. § 31), but also upon tacit 
use of *171 in case the bound variable a happens to be other than 
‘y’. *233 leads from 1189 to ‘V = y(y eV)’, and *171 leads 

thence to r V = a(a e V)" 1 . The second line of the following proof 
is similar. Hereafter, thus, *171 will fare like *233; its use will be 
left tacit in connection with daggered references. 

*250. |- r &<j> = A .= (a) (a e V.D ~ <£) n . 

Proof. fl92 \- r a e A" 1 . (1) 

+1S9, *224 |- r &<)> - A . = . H = <S(a e A) 1 
*244 = (a) (a e V .3 : 4> =• a e A)" 1 

*100, *123 = (a) (a e V .3. ~ <j> [s l]) 1 . 

From *249 and *250 it is clear that the identity r <5 <i> = V n is not 
in general equivalent simply to r (a)<t> 1 , nor r <J<£ = A n to r (a)~<^> n . 
On the contrary, x( . . . x . . . ) may be V even though ‘. . . x . . .’ 
fails for various non-elements x, and x( ... x ... ) may be A even 
though ‘. . . x . . .’ holds for various non-elements x. The ‘D ’ 
of *193 and *194 cannot in general be strengthened to ‘ = \ But 
‘y = A’ and ‘(x)(xiy)’ do prove equivalent, as seen in fl92a. 

In the theory of quantification it was found convenient not to 
require, for significance of r (a)4> 1 and r (3a)</>" 1 , that a be free in <t>. 
In the vacuous case where a is not free in <i>, such quantifications 
come out equivalent simply to </>. (Cf. pp. 74, 104.) Now it is 
likewise convenient not to require that a be free in <f> for significance 
of r a<j> 1 ', the definitions of class abstraction already adopted, in¬ 
deed, impose no such restriction. The significance of abstraction 
in the vacuous case where a is not free in <t> is explained in the fol¬ 
lowing two metatheorems, which are corollaries of *249 and *250. 
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*253. If a is not free in <j>, f- r &<j> = V . = <jp. 

Proof. *232 r [210 D](3«)(« e V) -1 . (1) 

*249 I- r ^ = V.= («)( ae V.3^ 

*161 (&hp) =. [1 3] <jp. 

*264. If a is not free in <i>, (- r &4> = A . = ~ <jp. 

Proof similar, using *250 instead of *249. 

Thus a vacuous abstract is capable of designating only V or 
A. It designates V if <f> is true and A if 0 is false. 

We might arbitrarily interpret the two truth values as V and A, 
thus speaking of the truth value of true statements as V and the 
truth value of falsehoods as A. Then the abstraction prefix r a n , 
when vacuous, comes to amount to a truth-value operator; r a<t>~ [ 
comes to designate the truth value of <j>. 

From the curious but trivial topic of vacuous abstraction we 
turn back now, for a short space, to the more serious topic of 
elementhood. 

It has been remarked (§ 28) that *200 provides merely a suffi¬ 
cient condition for elementhood, rather than a necessary one. 
Other sufficient conditions appear in the following metatheorems. 

*256. (- 3. ct<j> e V" 1 . 

Proof. *193 (- r (a)<p D. a<j> = V" 1 

*223 D: <$<*>« V[.- 210] 1 . 

*257. |- r (a)~ <t> D. <S</> e V" 1 . 

Proof similar, using *194 and f211. 

*258. If a is not free in <j>, |- r &4> e V" 1 . 

Proof. *118 (& hp), *123 \- r <j>D. W tV [: = 256]''. (1) 

*118 (& hp), *123 \- r ~ <t>D. &<t>eV [:= 257J 1 . (2) 

*100 (- r Cl • 2.3] 258L 

*259. \- r <4<£ ( V . = . —-0 e V 1 . 

Proof. Let /3 be new. Since r a i ft 1 is stratified, 

*200 h r /3 e V .3. d(cttP) tV\ (1) 

*231 (- r [(/S)l 3 e V .3. <$(a i &<j>) e V" 1 

*245, *224 3. &~4> « V. 

Similarly (- r <J~<£ t V .3. e V" 1 


(2) 
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*100, *123 3. <J<£ e V n - (3) 

*100 K[2 ■ 3 .3] 259L 

It has already been noted (§ 24) that the classes 
x(xix), x(y)~(x e y . y e x), x(y)(z)~(x e y . y e % . z e x), 

etc. are not elements. By virtue of *259, the same can be proved 
also of the classes 

x{x e x ), x(^y)(x ty.ytx ), a(3v)(3z)(x « y . y e z . z e x), 

etc. All these matters are recorded in the following theorems 
and metatheorems. 

f260. x(x i x) e V 

Proof. *235 x(x t x)t x(x i x). = . x(x e x)t V . x(x e x)i x(x i x) (1) 

*100 [1 3] 260 

■f 261. x(x e x) t V 
Proof. *259, *123 261 [= 260] 

*262. If a, 7 i, . ... , y n are distinct , 

b r a(7i) - - ■ (t n)~(a 6 7i . 7i e 72 .7 n e a) eV 1 . 

Proof. Let 13 be new. 

*230 \~ r oct a( 71 ) . . . ( 7 „)~(a € 71 . 7 »*a). = , 

a e V . ( 71 ) . . . (7n)~ (a « 7i. 7 -« a)L (1) 

*180 b r (a)(<* « d ■— Rl) 3. /S e V. (2) 

*231 KLO)2 3. (a)l 3] 262L 

*263. If a, 71 , . . . , 7 „ art’ distinct, 

b r a(37i) • • • (37*)(« e Ti ■ 7 1 « 72.7* e a) t V 1 . 

Proof. *133, *123 

b r a~(37i) ■ • • (37n)(a c 7i. 7 „ e a) e V [. = 262] 1 . ( 1 ) 

*259, *123 b r 263 [=iy. 

§33. Logical Product, Sum, Complement 


THE CLASS £(s e x . z e y), which has as members the com¬ 
mon members of x and y, is called the logical product of x and y 
and designated by the abbreviated symbolism ‘x r, y’. 
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D18. r (f r\ j;) 1 for r a(a e f . a e tj) 1 . 

The choice of a here is subject to the usual understanding: any 
variable foreign to f and r/. 

The connective ‘ ^ ’ is of a new kind. Whereas ‘ J, ‘v\ 

‘D ’, and ‘ = ’ join formulae to make formulae, and ‘e’ and ‘ = ’ join 
terms to make formulae, ‘r\’ joins terms to make terms. Like all 
binary connectives, ‘ ^ ’ is to be construed as carrying with it a pair 
of parentheses; and D18 has been fashioned accordingly. In 
practice, however, the parentheses will be dropped when there is 
no danger of confusion. 

The class z(z e x .v. z e y), which has as members all the members 
of x together with all the members of y, is called the logical sum of 
x and y. The class z{z i x), which has as members all elements 
except the members of x, is called the complement or negate of x. 

Symbolically the sum and complement are rendered ‘x ^ y’ and 
< - > 

X . 

D19. r (f w V ) n for r a(a e f .V. a t ij) 1 , 

D20. T for r <$ (a e f) n . 

Thus, where x is the class of Americans and y the class of poli¬ 
ticians, x r\ y is the class of American politicians; x w y is the class 
which comprises all Americans (politicians and otherwise) together 
with all foreign politicians; and x is the class of un-American 
elements. 

j270. (s,)(y)(x) z (x r\ y . = . z i x . z e y 

Proof. *230 (& D18) z t x rs y . = . z eY . z t x . z t y 

fl91, *123 =.ze*.zey 

f271. (z) (>’)(*) z e x yj y . = : z t x .v. z e y 

Proof. *230 (& D19) z e x ^ y =: z eV: z t x .v. zty 

*100 =: z tV . z t x .v. zeV.zey 

f 191, *123 ^'.zex.v.zey 

The analogue of f270 and f271 for negation, viz.: 

(z)(x) Z € X . = . Zt X 

is not forthcoming; on the contrary, z is a member neither of x nor 
of x (nor of anything else) in case z is a non-element. It is to 
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‘isV.ie*’, and not just ‘z i x’, that ‘z « x’ is equivalent (cf. *230, 
D20). 

In the proof of f270 the conjunction ‘z e V . z ex.zey’ is con¬ 
strued first as ‘zeV'.zex.zey’ and then as ‘zeV.zexl zey’. 
A tacit transformation according to *100 is thus involved. It is 
natural to leave such reassociative transformations tacit in the 
case of conjunction, thus treating r $ . fz . x " 1 in effect as r <j> . ft : x n or 
r <j> '.ft. x 1 at convenience; and similarly for alternation. 

Products, sums, and complements of elements prove to be ele¬ 
ments: 

|272. iy)( x ) x, y e V .D. x ^ y e V 

Proof: *200 (& D18), since ‘z ex. zey’ is stratified (witness 

‘0* 1.0 el’)- 

f273. (y)(x) x, y e V .D. x y e V Proof similar. 

|274. (x) xeV.^.xeV 

Proof, f 189, *224 x e V .=. y(y e x) e V 

*259 (& D20) =.ieV 

Corresponding to all tautologous biconditionals involving 
and ‘v’, it is easy with help of *245 to derive theorems of 
identity involving ‘ rs ’, and ‘ w ’. The first seven of the follow¬ 
ing theorems, e.g., correspond to the tautologous forms (2)~(4), 
(7), (8), (10), and (11) of §11. 


|275. 

(*) 

X = X 

f276. 

(x) 

*•= X r\ X 

f277. 

(x) 

X = X w x 

f278. 

(y)( x ) 

X r\ y = y r\ X 

f279. 

(y)(x) 

x w y =* y w x 

f280. 

(y){x) 

x r\ y = X W y 

f281. 

(y){x) 

X w y = X ry y 

f282. 

(y)W 

x r\ y = x w y 

|283. 

(y)W 

x kj y = x y 

f284. 

(y)(*) 

xr\y = (xKjy)r\y 

f286. 

(y)M 

x^y = (xr^y)^y 

|286. 

( z )(y)(x) 

(# y)r\ Z = X r\ (y 
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|287. 

|288. 

|289. 


(z)(y)( X ) (x w y) o' z - x u (y u z) 
(z)(y)(x) x r.(y v z) = (x n. y)w(* ^ z ) 
(z)(y)(x) x w (y o z) = (# w y)^{x w z) 


The proofs of such identities follow a fixed pattern. It will 
suffice to exhibit two samples. 


f 189 

x = y(y e x) 


*100, *188 

= y~~(y«*) 


*245 (& D20) 

= X 


*245 (& D19, 20) 

x w y = z~(z tx.v.z e 

y) 

*245 (& D20) 

= z~(z et.v.u 

y) 

*100, *188 

— z(z t x . z i y) 


*245 (& D20, 18) 

= X rs y 



The stacking of identity signs, in the above proofs, is subject to 
conventions exactly parallel to those which govern the stacking of 
conditional and biconditional signs (§ 17). Just as the inferences 
involved in the latter two procedures are justified respectively by 
*112 and *113, so the inference involved in the stacking of identity 
signs is justified by the principle: 

If b Ti = b r f 2 = r*" 1 . • • • . b = r n \ then b Ti - fn n . 
Proof, hp 1 b r fi = ft" 1 - (1) 

h P 2 ,*224 b^-ori-rr. ( 2 ) 

hp 3, *224 b r [2 =.] fi = ft" 1 , and so on. 


In metatheorems or theorems cited by starred or daggered 
numerals in the course of proofs, the distinction between r <£ . 
and . <fp will hereafter be disregarded; and likewise the dis¬ 
tinctions between r </> v \}P and v <fP, between r <f> = and 
r \f/ = <jp, between r f = rp and r ij = f" 1 , between n. y 1 and 
r y S'" 1 ; ar >d between r f w 17 1 and r y w ft. This procedure in¬ 
volves tacit transformations according to *100 (and *123) in the 
case of ‘v’, and ‘ according to fl83 (and *123) in the case 
of ‘ = ’; and according to 1278 or f279 (and *224) in the case of 
‘n’or’y’. Instances of this practice appear in the last lines of the 
above proofs of f275 and |283, where the = dx n of *245 is 
treated as r <*x = afP. This innovation is to apply only to cita¬ 
tions by starred or daggered numeral; divorced of its star or 
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dagger and used as an abbreviation, the numeral continues to stand 
for just the original expression without modification. 

|275 is the law of double negation; t280-|283 embody the class 
version of De Morgan’s law; and f276-f279 and f286-f289, by a 
natural extension of the terminology applied earlier (§11) to 
statement composition, are said to affirm the idempotence, com¬ 
mutativity, associativity, and mutual distributivity of logical 
multiplication and addition. 1 

The next two theorems, which have the form of biconditionals 
rather than identities, are analogues of the tautologous forms (17) 
and (18) of § 11; analogues in a different way, however, from the 
way in which f275-f289 are analogues of (2)-(4), (7), etc. 

f290. (y)(x) x = y . = . x = y 

Proof. 

f 191, *123 (& DIO) x = y .= (z)(z t V . z e x . = . z e V . z e y) 
*100, *123 = (z)(z e V .D : z i x . = . z e y) 

*244 (& D20) = .x=y 

f291. (y)(x) x=y.=.x = y 

Proof. f290 x = y . = . x - y 

|275, *224 =.x = y 

Unlike the identities, however, not all the statements which are 
analogous to tautologous forms in this broader fashion are true. 
The analogue: 

(y)(x) x 5 ^ y . = . * = y 

of the tautologous form (13), e.g., is false. Its truth would require, 
for every choice of y, that the only entity other than y be y; 
whereas we can easily prove e.g. that z(A e z) is neither A nor A. 

The next eight theorems bring logical addition, multiplication, 
and negation into relation with V and A. 

f292. A = V 

Proof. *193 (& D20) [(x) 192 D] 292 

1 Historically it is rather the application to statement composition that con¬ 
stitutes an extension. The terminology first developed in application to term con¬ 
nectives as in the present section. 
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|293. V = A 

Proof. |291 [292 =] 293 

[294. (.*) x rs x = A 

Proof. *230 (& D20) y e x . = . y eV . y e x (1) 

*100 [1 D]~ (y t x . y e x) (2) 

*194 (& D18) [(y)2 D] 294 

f295. (x) x w x = V 

Proof. f282 x w x = x r\ x 
|294, *227 = A 

t292 = V 

f296. (x) x V = x 

Proof. *246 (& D18) x V = y(y ex) 
f 189 = * 

f297. (x) x w A = x 

Proof, f 192 ye A (1) 

*100, *123 [189 =.] x = y(y ex.v. ye A [.1]) 

(q.e.d.; cf. D19) 

f298. (x) x r, A = A 

Proof. 1192 yeA (1) 

*100 [1 D] ~ (y e x . y t A) (2) 

*194 (& D18) [(y)2D]298 

|299. (x) x w V = V 

Proof. *100 y e \ y e x .V. y e\ (1) 

*249 (& D19) 299 [= (y)l] 

The notions of logical sum, product, and complement and the theorems f275- 
f299 belong to the old “algebra of logic” mentioned in § 1. The above method of 
deriving this class algebra from the theory of statement composition is essentially 
Peano’s. When inclusion and abstraction are taken as primitive (cf. §23), on the 
other hand, the theory of statement composition becomes simply a case of the 
algebra of classes; cf. my System, Ch. XII. For postulational analyses of this 
algebra as a segregated system see particularly Huntington (“Sets,” “New Sets”) 
and Stone. 



§34 


INCLUSION 


185 


§ 34. Inclusion 

IF ALL members of x are members of y, then x is said to be included 
in y; symbolically, x C y. The definition is thus: 

D21. r ac v y for r (a)( fle f.l ae , ,y, 

where a is foreign to f and ?j. The sign ‘ C ’ answers to the word 
‘are’, in such contexts as ‘Cats are animals’ (i.e., ‘The class of 
cats is included in the class of animals’). 

Inclusion must be distinguished carefully from membership. 
Where x C y it may or may not be the case that x e y, and where 
x e y it may or may not be the case that x C y. The class of cats, 
e.g., is included in the class of animals but is not a member of it; 
i.e., each cat is an animal but the class of cats is not. On the other 
hand the class of Mormons is a member of the class of Christian 
sects but is not included in it; it is true that the class of Mormons 
is a sect but false that each Mormon is a sect. Cases do arise, 
however, where x C y and at the same time x ey, e.g., any element 
x is both included in V (cf. f333) and a member of V. 

Like identity, inclusion is reflexive and transitive. 

f310. (x) xCx Proof: *100 (& D21). 

f311. (z)(y)(x) x C y . y C z .3. x C z 

Proof.* 100 w ex .D. w ey ‘.wey.D.wez ‘.D'.zv e x .D. w ez (1) 
*101 (& D21) Q(w)l D l](zv)(zv ex ,D. w ey\ w ey .D. tv e s)D. x C z (2) 
*140, *123 (& D21) [2=] 311 

But inclusion is not, like identity, commutative (symmetrical). 
Far from being equivalent, '* C y’ and ‘y C x’ are incompatible 
except in the reflexive case where y is x. This appears in the 
following theorem, according to which mutual inclusion is identity. 

f312. (y)(x) x = y . = . x C y . y C x Proof: *140 (& D10, 5, 21) 

The following metatheorems relate inclusion to quantification 
and abstraction: 

*313. If a is not free in f, |- r (a)(a e f .3 =. f C a<jp. 
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Proof. fl90 }- r ae f .3. a e V" 1 . (1) 

*100, *123 h r (a)(a € f .3 * [:l])s («)(« « f .3. a « V . ^ 
*230, *123 = («)(«« J-.3. 

*171 (& hp & D21) =. f C 

*314. \- r (a)(<t>D t)D.d<t> Cty\ 

Proof. *100 \- r 4>D \f/ .D: a eV . <f> .D \{P 

*230, *123 3: a e <S<*> .3 (1) 

*101 1- T»1 3 .](«)(* 3 *) 3 (a)(a e .3 jp 

*313 3. <S$ C 6 bfP. 

*316. If a is not free in f, (- r (a)(<j> 3 .aef) 3. &<j> C f. 

Proof. *314 |- r (a)(d> 3. a e f) 3. <S<£ C <S(a e f) n 

fl89, *224 (& hp) 1 3. H C f 1 . 

The conditionals described in *314 do not in general hold as bi¬ 
conditionals; r (a)(4> 3 i/') 1 is stronger than r dcj> C Truth of 

r (a) (<f> 3 ^) n requires that all entities fulfilling the condition <f> 
fulfill ip, whereas truth of r &(j> C dtfP requires merely that all 
elements fulfilling <f> fulfill \p. Similar reasoning accounts for the 
conditional character of *315. Note on the other hand that *313 
carries the biconditional. 

The next two theorems, analogues of the tautologous forms (5) 
and (6) (§ 11), show that logical products are included in their 
factors and logical sums include their summands. 

f316. (y) (x) x r\ y C x 

Proof.* 100 ztx.zty.O.ztx (1) 

*315 (& D18) [(z)l 3]316 

f317. (y)0) x C x w y 

Proof. *100 z t x .3: z « x .v. z e y (1) 

*313 (& D19) [(z)l =]317 

The next three theorems, embodying the class version of the law 
of transposition, are analogues of the tautologous forms (14)-(16). 

|318. (y){x) x C y .=. y C x 

Proof. *313 (& D20) x C y .= (z)(z e x .3. ziy) 


1 The hypothesis is needed along with fl89 to justify - a(a t f)" 1 . 
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*100, *123 = (z)(zey .3. z ex) 

*313 (& D20) =.yCi 

f319. (y)(x) xCy.^.yCx 

Proof. 1275, *224 xCy. = .xCy 
f318 =. y C x 

f320. (y)(x) x C y . = . y C x Proof similar. 

The ‘x C y’ and 'x C y’ of f318 and |320 amount respectively 
to saying that the classes x and y are mutually exclusive and that 
they jointly exhaust all elements. This is apparent from the 
following two theorems. 

f321. (}')(■*) x C y . = . x y = A 

Proof. *313 (& D20) x C y .= (z)(z < x .3. z e y) 

*100, *123 a (z)~(ze*.zey) 

f270, *123 = (z)(zix^y) 

fl92a s,.*oy = A 

|322. (y)(x) x C y . = . x vj y = V 

Proof. *230, *123 (& D20, 21) x C y .a (z)(z e V . z i x .3. z e y) 
*100, *123 a (z)(z e V.3 izex.v. z ey) 

*249 (& D19) m . x „ y = V 

The next two theorems, analogues of the tautologous forms (21) 
and (22), show ways of paraphrasing an inclusion as an identity. 

f323. (y)(x) x C y . = . x = x y 

Proof. *100, *123 (& D21) x C y = (z)(z « x . = . z e x . z e y) 
f270, *123 (& D10) =. x = x ^ y 

f324. (y)M x C y .=. y - x w y Proof similar. 

The following variants are occasionally of use. 


f325. (y){x) 

Proof. f284, *224 
t326. (?)(*) 

Proof. |285, *224 
t327. (y)(x) 

Proof. f318 x C 


x C y . = . x = (x kj y)rs y 
[323 a] 325 

x C y y = {x r\ y)\j x 
[324 =]326 

X C y . = . y - (y w x)r\ X 
y . = . y C x 
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f325 = ■ y = (y vy X) r\ X 

f275, *224 s. y = (y w X 

f328. (y) ix) x C y . = . x = {y ^ x) y 

Proof similar, minus the first step. 

The next four are analogues of the tautologous forms (28), (29), 
(42), and (47). 

f329. (z)-(v)(."v) xCy.Z).xr\zQyr\z 

Proof. *100 w ex .3. w ty ‘.’Dlwex.wez.D.wey.wez 

f270, *123 D:zvexr\z.D.zvtyr\z (1) 

*101 (& D21) [(a»)l 3]329 

f330. (s)(v)(x) tCy.D.tuiCjv! Proof similar. 

f331. ( s )()tK ;V ) x C y rs z . = . x C y . x C z 

Proof. |270, *123 (& D21) 

x C y r\ z = (zv)(zv t x .3. w e y . w e z) 

*100, *123 = (w)(w « x ,3. w e y w e x .3. w tz) 

*140 (& D21) =. x C y . x C z 

f332. (z)(y)(x) x ^ y C z . = . x C z . y C z Proof similar. 

Note, on the other hand, that the analogues: 

(z)(y)(.x) x C y ^ z : x C y .V. x C z, 

(z)(y)(x) jTAjCi.sItC z.v. y 0. z 

of (43) and (46) are false. 

V and A have the respective peculiarities of including and being 
included in every class. 

f333. (x) xCY Proof: fl90 (& D21). 

f334. (x) A C x 

Proof. f323 334 [= 298] 

Thence we have two corollaries. 
f335. (x) V C x . = . x = V 

Proof. f312 [333 .]VCx. = .x = V 

f336. ( x) x C A . = . x = A 

Proof. f312 *CA[. 334] . = . x = A 
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§35 

Although many of the classes included in a class x may be mem¬ 
bers of x, the following theorem shows that they cannot all be 


members. 

|337. (*) 

(3y)(y C x . y t x) 


Proof. *100 

z e x . z « z .3. z e x 

(1) 

*315 

[(i)l D.]i(j«*.zej) C x 

(2) 

*235 

L2 e L2 .=. L2 « V. L2 e x . L2 e L2 


tl91, *123 

= . L2 e x . L2 e L2 

(3) 

*100 

£3 3.] L2 e x 

(4) 

*232 

[2.4 .3] 337 



A, for example, is included in A but is not a member of A; again 
any non-element is included in V but is not a member of V. 


The sign ‘ 3 ’ was used for inclusion by Gergonne as far back as 1816; Russell 
has reversed it to distinguish it from the conditional sign. Most of the theorems 
of the present section were known to Peirce and Schroder. 

The distinction between inclusion and membership became fully clear only with 
the advent of quantification theory; it was clear to Frege in 1879, to Peano in 1889, 
to Peirce in 1885 (3.396). But in a tentative form the distinction existed in tra¬ 
ditional logic as a distinction between “distributive” and “collective predication,” 
drawn to resolve the fallacies of composition and division (e.g. Peter is an Apostle, 
the Apostles are twelve, therefore Peter is twelve). 


§ 35 . Unit Classes 

THE CLASS whose sole member is x is called the unit class of 
x, and designated by the abbreviated symbolism ‘ix’. 1 

D22. for r <S(a = f) -1 . 

The first theorem shows that # is a member of y if and only if lx 
and y are not mutually exclusive. Two corollaries follow it. 

| 340. 2 (y)(x) x e y .=. ix r\ y A 

Proof. *234b x ty .= (3z) (z = x . z < y) 

1 The notation is Peano’s; ‘i is for 'tarn’. The distinction between lx and x 
seems first to have been drawn explicitly by Frege in 1893 and Peano in 1894. 
See end of § 25. 

2 Note that lx /-»y is the product of lx and y, whereas i(x r\ y) is the unit class 
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fl91, *123 = (3z) (z - x . z eV . z e y) 

*230, *123 (& D22) = (3z)(z eix.zey) 

■f 270, *123 s (3z)(z f«Aj) 

f 192b =. ix n y 7 ^ A 

f341. (y)(x) xiy . = . ix C y 

Proof. *100 [340 3 f\ x t y . = . ix r\ y = A 
|321 =. lx C y 

f342. (y)(x) x € y . = . x «V . lx C y 

Proof. *230 (& D20) x ey . = . x e V . xiy 

f341, *123 =.xtY .ix Cy (1) 

f275, *224 342 [=1] 

If x is not an element, and hence a member of no class, there is 
of course no class whose sole member is x. In this trivial case 
then, the ix defined in D22 must turn out to be something else; 
and it turns out in fact to be A. 

|343. (x) x e V . = . ix = A 

Proof. f341 xeW . = . wCV 

f293, *224 =,«CA 

f336 =. ix = A 

If on the other hand a: is capable of membership at all, ix has x 
as member. 

f344. ( x ) x e V . =. x e lx 

Proof. *235 (& D22) x e V [. 182] . = . x e ix 

Membership in ix involves not only identity with x, but element- 
hood of x. 

f346. (Y)(.r) y « lx . = . x e V . x = y 

Proof. *230 (& D22) y e ix . = . y e V . y = x 
*228 =.xt\.x = y 

The composite connective ‘ft’ thus expresses element-identity; 
it is stronger than ‘ = Its commutativity is seen as follows. 


of * y. Just as is understood as governing the shortest statement following 
it (cf. §§ 2, 7), so ‘i’ is to be understood always as governing the shortest term 
following it; and similarly for any other singulary operator. 
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f346. (y) (x) x t ty . = . y e lx 

Proof. *230 (& D22) x t ty . = . x e V . x = y 
f345 =. y e lx 

The next theorem shows that ix is the class whose sole member is 
x if and only if x is an element. 

f347. ( x ) (y)(y e ix . = . x = y)= . x e V 

Proof. *100, *123 L347 = (y)([345 =:]* = y.D.^eV) 

*234a =.at«V 

The following theorem, an extension of the foregoing one, shows 
that ix w iy is the class whose sole members are x and y if and 
only if x and y are elements. 

f348. (y)(x) (z) (z e lx ^ ty . =: z = x .V. z = y) =. x, y e V 

Proof.' 

t271, *123 L348 = (z)(z t lx .v. z c ty !=! z = x .v. z = y) 

*235, *123 (& D22) = (z)(ztV. z = x .v. zt V. z = y '.= \z = x . v. z= y) 
*100, *123 = (z)(z t V z = x .v. z — y 1=! z = x .v. z = y) 

*100, *123 s (z)(z - 

*100, *123 = (z)(z = x .3. z e V : z = y .3. z e V) 

*140 =. (z)(z = x . 3 . z « V) . (z)(z - y . 3 . z « V) 

*234a, *123 m. x ,y*V 

The next two theorems show that the words ‘y is the class whose 
sole member is x’ can be put into symbols in three ways: 

(z)(z e y . = . z = x), y = ix.xtW, yCix.xty. 

|349. (y) (x) (z)(z t y . = . z = x) = . y = lx . x eV 


Proof, f 189 

II 

(1) 

*121 (& D22) 

L349 3: [1 =.] y = lx 

(2) 

*100 

[2 3L349 =. y — lx . L349 


*228 

= .y = lx. (z)(z e lx . = . 

z = x) 

f347, *123 

=. y = lx . x e V. 


|360. (y)O) 

(z)(z € y . = . z=x)=.yCix.xty 



1 This proof, neater than my own, is due to N. D. Gautam of Herbert College in 
India. 
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Proof. f312, *123 £349 = :] L350 =.yCu;.u;Cy.*eV 

-j-342, *123 =.yCix.x ( y 

A peculiarity of ix is that it is included in any class which it 
overlaps at all. 

|351. (y)(x) ur C y .V. ix r\ y = A 

Proof. *100 £342 .]*ey .3. ix C y (1) 

*100 £1.340 .3] 351 

In other words, ix is included in each class or its complement. 
f352. (y)(x) ix C y .V. uc C y 

Proof. t321, *123 352 £s 351] 

Inclusion of ix is analyzable as follows. 
f3&3. (y) (x) txCy.= :xeV.D.xey 

Proof. |345, *123 (& D21) ix C y .= (z)(x eV,* = z .3. z e y) 
*100, *123 (z)(# = z .3: x t V .3. z e y) 

*234a =:*eV.3.*ey 

Two consequences follow. 

|364. (z)(y)(x)(w) j; « V . a u y =■ a u w J: * = s .v. x = w 
Proof. |317 ix C ix w y (1) 

*223 ix w y - a ^ tw .3: £1 = .] ix C a ^ iw 

f353 D: a; e V .3. * e iz u w (2) 

*100 £2 .] L354 .3. x t iz w vui 

f271 D l x t iz .v. x t iw 

*230, *123 (& D22) D: x *Y . x = z .v. x tY . x = w 

*100 3 : x = z ,v. x = w 

f355. ( K )(y)W x eY . ix ^ y = iz .3. x = z 

Proof. f277, *224 L355 0. x eY . ix ^ y = iz w iz 

]354 D: x = z .v. x = z 

*100 D.x = z 

The next theorem shows that tx has one member at most. 
f356. (z)(y)(*) y, z«a:.3.y=z 

Proof. f345,*123 y,iea .3. x « V . x = y . x tY . x = z 
*100 3. x = y . x = z 

*100 3: £1843.] y = z 
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The next shows that lx includes two classes at most. 


t367. 

(?)(*) 

y Q lx .= m .y = lx M. y — h. 


Proof. 

*223 




LX r\ y = ' 

1 ? .3 lx r\ y — ix .v. lx r\ y = A ! = 

R357 (1) 


f323, *123 

yCw.D. [351 =] R357 [:=1] 

(2) 


[312 

y — ix.D.yC lx.L xCy 



*100 

D. y C lx 

(3) 


t334 

A C lx 

(4) 


*223 

y =A,D:yC«[.= 4] 

(5) 


*100 

[3.5.] R357 .D. y C lx 

(6) 


*100 

[2.6.3] 357 


The next shows that distinct elements have distinct unit classes. 

[368. 

(y)(*) 

a;cV.D:« = Ly . = . x = y 


Proof. 

*226 

x = y .3. lx — Ly 

(1) 


*223 

lx = Ly .31. [344 =:] x t V . = 

= . x e iy 


*230 (& D22; 

), *123 3 : x e V . = . x e V . x 

= y (2) 


*100 

x e V .3:. [2 3:] lx = iy .D. x 

= y 


*100 

3 [1 3:] or = ry . = . x 

= y 

Regardless of whether x is an element, lx proves to be an 

element. 

f369. 

(*) LX 

e V 


Proof. 

Since £ (z)(z 

e y .=. z e x)’ is stratified, 


*200 

xtW .D. y(z)(ze V .Dlzty . = . zt x) t 

V 

*100, *123 

3. y(z)(z tV.if)). = .ieV.ie 

x) t V 

1191, *i: 

23 (& D10, 22) 3. Lxt V 

(1) 

*223 

LX ~ 

= A .3 : lx e V [.= 211] 

(2) 

*100 

[1. 

2.343 .3] 359 


t360. 

(.v)0) 

lx w iy e V 


Proof. 

[359 iy t V 


(1) 


[273 [359. 

1 .3]360 


|361. 

(y)W 

LX rs y 6 V 


Proof. 

[323 lx C y 

.3. IX = IX rs y 



*223 

3: [359 =.] lx rs y e V 

(1) 


*223 lx rs y 

= A.3:u7o}76V [.= 211] 

(2) 


*100 [1.2. 

351.3] 361 
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In ‘y lx' we have a notation for the class of all members of y 
except x. 

|362. (z)(y)(*) z « y rs lx .=. z e y . z x 

Proof. *245 (& D20, 22) ^ = z(z 9 * x) _ (1) 

*223 [1 3:3 z « y /-i i*.=.Hyo j(i^ x) 

f270 ss. z e y . z ez(z x) 

*230, *123 =.zty .ztN .z ^ x 

fl91, *123 = . z e y . z x 

That class turns out to be an element if and only if y itself is. 


f363. (y)0) 

y « V .=. y <-\ lx tV 


Proof. f274 

[359 ss.] lx tV 

(1) 

f272 

y e V [.I] .3. y r\ lx eV 

(2) 

f273 

y rs ix eV [.3593 .3. (y r\ lx) w lx e V 

(3) 

|326 

ix C y .3. y = (y a lx) ^ lx 


*223 

3:. y lx « V .3. y e V [: = 33 


*100 

3. [2 33 363 

(4) 

f318 

lx C y .3. y C lx 


|323 

3. y = y r\ lx 


*223 

3 363 

(5) 

*100 

[4.5.352.33363 



From f363 it is apparent that every non-element differs from 
every element in point of infinitely many members; for if we 
could reach z from an element w by n steps of deleting or inserting 
single members, i.e. by n transformations of the type “y to y ^ ix” 
or “31 n a to y”, then we could establish elementhood of z by n 
applications of f363. In particular it follows that every non¬ 
element differs from both A and V in point of infinitely many 
members; hence every finite class is an element, and every class 
which is so large as to exclude only a finite number of elements is 
an element. 



CHAPTER FIVE 


RELATIONS 




§ 36. Pairs and Relations 


IT IS natural to regard any condition on one variable as deter¬ 
mining a class: the class of all entities x satisfying the given con¬ 
dition. Correspondingly it is natural to regard any condition on 
two variables as determining a relation: the relation of any entity 
x to any entity y such that x and y satisfy the given condition. 
We have seen (§ 24), however, that this point of view is untenable 
in the case of classes; and the same is readily seen in the case of 
relations. For, suppose z is the relation of x to y such that x and y 
satisfy the condition ‘x does not bear the relation x to y’. I.e., 

(1) (y)(#J x bears z to y .= bears x to y). 

Thence we have the self-contradictory consequence: 

(2) (y) z bears ztoy.= ~(z bears z to y). 1 

In the case of classes the analogous difficulty was obviated by 
barring certain entities, the so-called non-elements, from mem¬ 
bership; in other words, by countenancing as classes only classes 
of so-called elements. A similar expedient suffices here: counte¬ 
nancing as relations only relations of elements. Instead of asking 
in general that there be such a thing as the relation of any entity 
x to any entity y such that ... a:... y ... , then, we ask only that 
there be such a thing as the relation of any element x to any element 
y such that. . . x . . . y . . .. The difficulty encountered in (2) now 
disappears; for, where z is the relation of any element x to any 
element y such that ~(# bears x to y), we have: 

(3) (y)(x) x bears ztoy. = .a, y e V . bears x to y) 
rather than (1). From (3) we obtain no contradiction, but only 
the harmless consequence: 

(y) z bears z to y . = . z, y e V , ~(z bears z to y), 
and the corollary 'ze V’. 

The new abstraction prefix ‘the relation of any element x to any 
element y such that’ will subsequently be defined, and the appro- 

1 Pointed out by Russell, Principles, Appendix A. 
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priate laws will be established (§ 37); but first we must look to the 
general notion of relation. It is characteristic of a relation to pair 
elements off with elements according to one or another plan. 
The numerical relation less than, e.g., pairs 0 off with 1, also it 
pairs 0 off with 2, also it pairs 1 off with 2, and in general it pairs 
x off with y wherever x < y. The relation father of, again, pairs 
Abraham off with Isaac, and in general it pairs x off with y where- 
ever x is father of y. It is thus convenient to think of each relation 
as a class of pairs of elements. To say that an element x bears a 
relation z to an element y is to say, then, that x paired with y 
forms a member of z. 

This notion of pair now needs explicit formulation. It will be 
rendered symbolically in the fashion ‘x\y’, read ‘x paired with y’; 
and we have now to frame a satisfactory definition of this notation. 
Since relations are to be classes of pairs of elements, these pairs 
must themselves be elements; otherwise we could not have classes 
of them. Here, then, is one demand which our definition must 
meet. Further, pairs must be conceived in a non-commutative 
fashion, as ordered pairs; they must be distinguished not only 
when they differ as to their constituent elements, but when they 
are formed from the same elements in reverse order. We must 
distinguish e.g. between Abraham paired with Isaac and Isaac 
paired with Abraham, for the former pair is wanted as a member of 
father of whereas the latter pair is not. But any notion of pair 
which meets these demands will serve our purpose. Now it will be 
shown that the demands are met by the following definition. 

D23. r (f;i/) n for r («f w i(if w n?)) 1 . 

According to this definition, x;y is the class whose sole members 
are ix and lx w iy. 

|410. (z)(y)(x) z e x;y . =: z = ix..V. z = ix w iy 

Proof. |348 (& D23) (z)410 [=. 359.360] 

These members ix and a u iy of x;y are respectively the class 
whose sole member is x and the class whose sole members are x 
and y, in case x and y are elements (cf. f347, f348.) 

The first of our demands on x;y, elementhood, turns out to be 
met independently of whether x and y are elements. 
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f411. (y) (x) x\y e V Proof: f360 (& D23) 

The other demand was that pairs of elements x',y and z;w always 
be distinct except where x and y are the same respectively as z and w. 
f417, which shows that this demand is met, will be reached after 
five preliminary theorems. 

f412. (z)(y)(x)(w) x m ,y = z\w .3: tx = iz .v. ix = iz v> cw 

Proof, f 182 ix = ix (1) 

*100 [13:] ix = ix .v, tx = ix iy (2) 

*223 x;y = z\w .3: uc t x',y . = . ix t z;zv 
f410, *123 3. [2 =] R412 

f413. (z)(y )(#)(«/) x;y = z;w .3: w w ty = tz .v. « w iy = iz vj i w 
Proof similar. 

f414. (z) (y) (#) ( w ) x e V . x;y = z\w .3. x = z 
Proof. f358 x t V .3: iz = tx .=. x = z (1) 

f412 x\y = z;u> .3: tz = we .V. iz = u: w ty (2) 

|355 x t V . iz = ix w iy .3. x = z (3) 

*100 [1.2.3.3] 414 

Hitherto the use of *100 was restricted to cases where the com¬ 
plexity of the relevant tautologous form did not exceed a certain 
arbitrary limit (cf. § 17). Hereafter, for brevity, this restriction 
will be waived; it is already waived, indeed, in the last step of the 
above proof. The reader has observed the use of *100 through so 
many proofs that he will now usually be able, even in these more 
complex cases, to recognize tautology on reasonably brief ex¬ 
amination; and when there is doubt the method of tables is always 
available. 

f415. (z)(y)(*)(«>) y e V . *;y = z;w .3: y = z .v. y = w 


Proof. |355 y e V . « u ty = iz .3. y = z (1) 

f354 y « V . ir u iy = iz u ® .3: y = z .v. y = w (2) 

*100 [413.1.2 .3]415 

f416. (z)(y)(*)(«0 x, y, w t V . x;y = z\w .3. y = w 

Proof. f415 w e V . x;y = z\w .3: w = x .v. y = w (1) 

*223 w - x .3:. L414 3. w - z [:= 414] (2) 


*223 W = z.3::L415 3:y = w.v.y = w [.:= 415] (3) 
*100 [1.2.3 .3]416 
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[417. (z)(y)(x)(zr) x, y, w « V .3: x;y = z\w . = . x = z.y = w 
Proof. *226 y = w .3. x;y = x;a> (1) 

*223 x = z .3£ 1 = y = w .3. x;y = z\w (2) 

*100 [414.416. 2. 3] 417 

Two minor variants of [417 follow as corollaries. 

[418. (z)(y)(x)(w) x,y e V .3: w e V . xyy — z\w . = . x — z . y = w 

Proof . *223 y = w .3: y « V . = . w e V (1) 

*100 [417.1.3]418 

[419. (z)(y)(^)(w) x,ytV .3: z,W e V . *;;y = z;a> .=. x - z .y = w 
Proof. *223 x — z.3:^cV . = .zeV (1) 

*100 [418.1 .3]419 


Where a; is a relation, the statement that y bears x to z may 
conveniently be rendered in symbols as ‘x(y,z) ’. The way to 
define this notation is clear from foregoing considerations: it is to 
mean that y and z are elements whose pair y;z is a member of x. 

D24. r (f(.7,0)r for r (v,8eV . v# ( 

(The outer parentheses will be retained only after a circumflex.) 

Thus x(y,z)‘ is short for ‘y, z « V . y;z t x’. It may at first 
appear that the clause ‘y, z«V’ to the effect that y and z are 
elements is superfluous here, since ‘y;z * x’ already implies element- 
hood of y;z. But actually the clause is not superfluous, for y;z is 
an element even when y and z are not (cf. [411). Where y is not 
an element, we readily see from [343 and [297 that y;z reduces to 
tA v uz; yet from ‘lA^uzex’ we should not want to infer 
that every non-element y bears x to z, even though it is indeed 
trivially the case that y;z«x. Again, where z is not an element, 
y;z reduces to uy; yet from ‘try e x’ we should not want to infer 
that y bears x to every non-element z. Again, where neither y nor 
z is an element, y;z reduces to tA; yet from ‘tA e x’ we should not 
want to infer that every non-element y bears x to every non¬ 
element z. Trivial results are thus excluded by incorporating 
‘y, z « V’ into the definition of ‘x(y,z) ’. 

[420. (y) (x) V (x,y) =. at, y « V 

Proof. *100 (& D24) V(x,y) =. x, y e V [. 41 lj 
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f421. (z)(y)(x)(w) x e z . y e w .3 V(x,y) 

Proof. tl90 xez.D.xeV (1) 

fl90 yew. D.yeV (2) 

*100 [1 . 2.420 .3] 421 

f422. ( s )(y)(x)(V) x C y .D. x{z,w) D y(z,w) 

Proof. *231 (& D21) *Cy.D: z;w e x .D. z;w e y 
*100 (& D24) D. x(z,w) D y{z,w) 

f423. (z)(y)(x)(w) (x ^ y)(z,w) =. x{z,w). y{z,w) 

Proof. f270, *123 (& D24) 

(x y) ( z,w ) =. z, w e \ . z\w e x . z;w e y 

*100 (& D24) =. x(z,w) . y(z,w ) 

f424. (z) (y)(x) (?c) (x w y){z,w) =. x(z,zc) v y[z,w) 

Proof similar. 

Relations in the sense here considered are known, more partic¬ 
ularly, as dyadic relations; they relate elements in pairs. The 
relation of giving (y gives z to w) or betweenness (y is between z 
and w), on the other hand, is triadic; and the relation of paying 
(x pays y to z for w) is tetradic. But the theory of dyadic relations 
provides a convenient basis for the treatment also of such polyadic 
cases. A triadic relation among elements y, z, and w might be 
conceived as a dyadic relation borne by y to z\w. Actually there is 
some advantage in conceiving it rather as a dyadic relation borne 
by uy to z\w , for this proves to simplify the conditions of element- 
hood for triadic relations. The analogue of D24 for triadic relations 
would therefore assume this form: 

r tt(v,e,d')r for 

Tetradic relations could be handled on the basis of triadic ones in 
similar fashion: 

r (f 0?> v'y 9, 9')) 1 for r (t], r f, 0,0 'eV. f(urj, at]', 0;0'))~ l - 

Similarly for pentadic relations, hexadic ones, and so on. How¬ 
ever, we shall have no occasion to proceed beyond the dyadic case. 


The notion of a relation as a class of ordered pairs goes back to Peirce (3.220, 
329), and the notation ‘x;y’ to Frege and Peano. But Wiener (1914) was the first 
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to show that the ordered pair could be defined within the theory of classes. He took 
x;y as i(ux w tA) w lily, though if it had not been for Russell’s theory of types he 
would obviously have adopted the simpler version i(lx yj iA) w uy. The alterna¬ 
tive version adopted in D23 is due to Kuratowski. In the present system simplicity 
would have been gained in some directions, and lost in others, by construing x\y in 
still a third way: as z (3tv)(wex . z = ito .v. we y . z — iw). This version has the 
advantage of rendering the antecedent y, toe V’ superfluous in + 417 . 


§ 37. Abstraction of Relations 

JUST AS the prefix ‘ x’ of class abstraction corresponds to the 
words ‘the class of all elements * such that’, so the prefix ‘xy’ of 
relational abstraction is to correspond to the words ‘the relation 
of any element x to any element y such that’. 1 The relation factor 
of, e.g., can be expressed in terms of relational abstraction as 

*y(3z)(z e integer. x x z = y). 

The form of notation ‘xy( ...#... y ...)’ is readily defined if we 
reflect that this relation is to comprise as members just those pairs 

x;y of elements such that... x ... y - Anything % is to belong 

to xy(... x... y ...) just in case 

(3*)(3y)(*, y « V. z = *;y-* . . . y ...). 

The definition adopted is thus the following. 

D25. r <S/3^ n for r -y(3a)(3/3)(a, /3 e V . y = a;/3 . tf>)\ 

Accordingly we have the following metatheorem. 

*430. If a and f) are free in neither f nor ij, 
b r ^<t> (to) =■ n «V. (3a)(3j3)(a = f. 0 = t). thy. 

Proof. t411 b r f VT (1) 

*235 (& D25 & hp) 

b e ■ [1 •] (3a)(3jS)(a, /8 e V . = a;/? . <f>y. (2) 

t419, *117 (& hp) 

b n « V ,D (a)(0)(a, 0 e V . f ;ij = a;/S . = . a = f . 0 = ij) 1 

1 This notation was adapted from Frege by Russell. My reference to element- 
hood, in interpreting the notation, is of course a departure. 
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*121 D:[2=.]L2s(3a)(3/3)(a = f.j 8 = i 7 . 0 )L (3) 

*100 (& D24) h r [3 3] 430 1 . 

In practice the notation r a/30 1 will appear only where a and /3 
are distinct variables. No such explicit restriction needs be im¬ 
posed, however, because the desired metatheorems governing 
r ($l 30 1 turn out not to be violated by the case r aa 0 n . We find 
that xx( . . . x ... ) is the relation which each member of x( ... x 
. . . ) bears to itself. 

*431. j- r aa0 (f,i?) =. f = 17 ■ f t a<jP. 

Proof. Let a' and j9 be new and distinct, and let 0' be formed 
from 0 by putting a' for all free occurrences of a. 

*430 |- r a<$0 ( a ', /3) =. ol , P t V . (3a)(3a)(a = a' . a — . 0) n 

*137, *123 = . a’, P e V. (3a)(a = a*, a = /3. 0)" 1 

*234b, *123 a', V. a' = 0. 0 n 

*228, *123 = . a', a'e V. a' =/3.0' n 

*100 =£. a' = /3.. a'tV.0'" 1 

*235, *123 = . a' = <3 . a'e a0l (1) 

*233 l-1(a')03)l 3]431 n . 

Thence we have the following corollary. 

* 432 . (r.r) =-r« 

Proof. 1182 h r f - f. _ (1) 

*431 |- r aa0 (f,f) =. [1 .] f e a0'. 

Since xy( ... x ... y ...) is the relation of any element x to any 
element y such that . . . x ... y ... , any entity x will bear the re¬ 
lation to any entity y if and only if x and y are elements and 
. . . x . . . y . ■ ■■ That the closures of such biconditionals are theo¬ 
rems- is established in the following metatheorem. It is the ana¬ 
logue, for relations, of *230. 

*433. I- r (a,0) =.a, j3eV.0L 
Proof. Case l: a is (3. 

*432 |- r aa0 (a,a) =. a e acjP 

*230 =, a e V . 0 1 

*100 =.a, afV.0 1 , q.e.d. 

Case 2 : a is not /3. Let a and (3' be new and distinct, let 0' be 
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formed from <p by putting a' for all free occurrences of a, and let 
<f>" be formed from <p' by putting /3' for all free occurrences of /3. 

*430 |- {a!S') =. a', P'eV. (3a)(3/3)(a = a'. /3 =/3'.0 ) 1 

*158, *123 =.a', 0 'eV. (3a)(a - a\ (3/3)(/3 = & . p)) 1 

*234b, *123 =. a ',p'tW . (3/3)(j5 = (S'. <t>'Y' 

*234b, *123 =.a',(3'eV.f. ( 1 ) 

*233 J-r[( a ')(/3')l D1433 1 . 

The principle embodied in *433 has the following as its more 
general formulation. This is the analogue of *235. 

*434. Ifi is like cf> except for containing free occurrences of f and r) 
in place respectively of all free occurrences of a and /3, then 
\- r 6$<t> (t,v) =■ t> V e V . ip 1 . 

Proof. *233 (& hp) j- r [(a) (0)433 3]434 n . 

The following is the analogue, for relations, of *314. 

*435. (- r (a)(0)(</> 3 ^)3. C 
Proof. Let 7 be new. 

*100 \- r 4> 3 ^ .3 : a, /3 < V. y = a;/3 . <t> .3. a, 1 3 e V . 7 = a ;0 . ip 1 . ( 1 ) 

*101 K [09)1 DO cs)(*3*)3 (/3)Rl n 

*149 3. (3/3)LR 1 3 (30)RR1L ( 2 ) 

Similarly, from (2), 

b r («)(0)(</> 3 *)3. (3«)(30)LR1 3 (3«)(30)RR1L (3) 

(3), *117 b r («)03)(4> 3 *)3(y)((3«)(3/3)LRl 3 (3«)(3/3)RRl)-> 
*314 (& D25) 3. dfo C &PP 1 . 

The iterated use of ‘L’ and ‘R’, as above, is self-explanatory; 
‘LRl’, e.g., stands as an abbreviation for the matter: 

a, 1 3 e V . 7 = a ;0 . <p 

which stands to the left of the main connective in the expression: 
a, 0 e V . 7 = a;/S . <p .3. a,0 « V . 7 = a;)3 . \p 

for which ‘Rl’ stands. (Cf. §24.) 

The analogue of fl89, viz. ‘ (x)(x = yz(*(y,z))) ’, is readily seen to 
be false; for the relation yz(x(y,z)) will have only pairs of elements 
as members, whereas * itself may have some non-pairs as members 
besides. The relation yz(x(y,z)) is rather the relational part of x 
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— the class of all those members of x which are pairs of elements. 
It is the largest relation included in x, and it coincides with x 
only in case x is itself a relation. It will be referred to briefly as x 
When we are concerned with a complex expression in place of ‘x’ 
the superior dot may conveniently be placed at the beginning; 
e.g., (x y) ’. The following definition is accordingly adopted. 

D26. rfi 0 r j or 

The following theorem and metatheorem reveal positions from 
which the dot is suppressible. 

f436. (z)(y)(x) x(y,z) = x(y,z) 

Proof. *433 (& D26) x(y,z) =. y, z t. V . x(y,z) 

*100 (& D24) e= x (y,z) 

*437. \- r '<$$4> = AtoP. 

Proof. Case i: a is not ft. 

*433, *188 1- r dp L433 - dp R433 -1 

*100, *188 (& D25) = dfo\ (1) 

*171 (& D26) j- r [l=]437 n . 

Case 2 : a is fi. Let y and 8 be new and distinct. 

*137, *188 (& D25) |- r dd<j> = i(3«)(a, oeV.!-«;«. t) 1 
*234b, *188 (& D25) = dy(<t>. a - yp (1) 

*437 (Case 1) = 'dj{<t>. a = y) 1 

(1), *227 = -d&<p. 

Use of f436 and *437 will be tacit; dots will simply be dropped, 
without comment, from the positions r t(y,d)~ l and r 'd$<tP. 
(Cf. proofs of |446, f447.) This practice is still followed when the 
relevant context r t(v,0) 1 or r 'd@<f> 1 is concealed under defini¬ 
tional abbreviations. E.g., the second dot in ‘ f’ would be dropped 
by tacit use of *437 on the ground that it has the position 
“ yzx(y,z)’ when ‘x’ is expanded by D26. (Cf. proof of 'j‘450; also 
§ 38.) 

The relational part V of V is the universal relation — the rela¬ 
tion which every element bears to every element. It is the class 
of all pairs of elements. 

f438. V = x(3y)(3z)(y, z eV. x = y;z) 
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Proof. t420, *188 (& D26) V = yi(y, z e V) 

*100, *188 (& D25) = R438 

The following two theorems provide alternative renderings of x. 
f439. (x) x = yz(y;zex) Proof: *100 , *188 (& D24-26). 
|440. (a;) x = x r\ V 

Proof. *228, *123 (& D25) 

[439 = f\x = w>(3y)(3z)(y, z e V . w - y;z . w e x) 

*158, *188 = w(3y)(tv e x . (3z)(y, z e V . w = y;z)) 

*158, *188 = w(w t x . (3y)(3z)(y, zeV . w = y,z)) 

*245 = w(zo ex .tv e w{ 3y)(3z)(y, ze V . w = y; z)) 

|438, *227 (& D18) = ^ V 

The relational part A of A is simply A. The null class and the 
null relation are the same. 

f441. A = A 

Proof. f440 A = A ^ V 
f298 = A 

To say that * is a relation is to say that all members of x are 
pairs of elements. Thus a convenient symbolic rendering of ‘x is a 
relation’ is ‘x C V’. Another is ‘x = The following theorem 
establishes the equivalence of the two. 

f442. (x) xCy.m.x = x 
Proof. f323 * C V ■ = • x = x r\ V 

t440, *224 =.x = x 

Since a relation is merely a class of pairs, the notion of class 
inclusion applies to relations in obvious fashion. The relation of 
father, e.g., is included in that of parent. The following four 
theorems deal with inclusion of a relation x. 

f443. (*) xCx 

Proof. f316 x rsfj Q x (1) 

f440, *224 443 [=1] 

f444. (*) iCV 

t445. (y)(x) xCy. = .xCy 


Proof similar. 
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Proof. f331 iCy[. 444] . = . xCyr,V 
f440, *224 =.iCy 

f446. (}’)0) x C y .= (z)(w)(x(z, w) 3 y{z, w)) 

Proof. f422, *117 iCy.D R446 (1) 

*435 (& D26) R446 3.xCy (2) 

*100 [1.2.445 .3] 446 

In the first line of the above proof, note the tacit transition from 
‘ x{z,zv) ’ to l x(z,zv)’. This illustrates the convention which was 
adopted in connection with f436. The first line of the next proof 
is similar. 

|447. (y)(x) x = y .= (z)(w)(x(z, w) = y(z, zv)) 

Proof. *223, *117 x - y .3 R447 (1) 

*186 (& D26) R447 3. A = y (2) 

*100 [1.2.3] 447 

Since each relation x is identical with x (cf. |442), theorems 
containing dotted variables may be read simply as theorems about 
all relations. The dot on a variable serves, in effect, merely to 
confine a law to relations. f444, e.g., may be read as saying that 
every relation is included in V; and |447 may be read as saying 
that relations are identical if and only if they hold between just 
the same things. 

The notions of logical sum and product, like that of class in¬ 
clusion, apply to relations in obvious fashion. The sum of the 
relations father of and mother of, e.g., is the relation parent of', 
and the product of the relations fond of and parent of is the relation 
fond parent of. The first and third of the following theorems show 
that the sum of the relational parts of two classes is the relational 
part of the sum and that the product of the relational parts is the 
relational part of the product. 

|448. (v)(*) x w y = '(x w y) 

Proof, f288 (V ^ *) w (V ^ y) = V ^ (x w y) (1) 

1440, *224 448 [=1] 

f449. (v)M £ r\ y = '{x rs y) 

Proof similar, using f286. 
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f450. (y)(x) X ^ y = '{x ^ y) 

Proof. |449 x r-. y = '(x ^ y) 

1449, *227 = fx r, y) 

In this last step, note the tacit transition from ‘"(xr\y)’ 
to ‘'(x n. y)’. This illustrates the convention explained earlier. 

The last two of the above theorems show that the dot may be 
applied to a product or to either or both factors indifferently. But 
the analogue of f449 for the logical sum does not hold. 

The definition D25 of relational abstraction might equivalently 
have been formulated in the fashion: 

D25'. for r 7(3a)(3/3)(i7 (a,|8) . pP, 

in view of the theorem: 

(z)(y)(v) tz (*,y) x, y e V . z - x;y. 

Proof. *235, *123 (& D22, 24) 

iz (,*,y)=. x, y t V 41 lj . x;y = z 

The variant D25' recommends itself if one is seeking a model for 
definitions of abstraction of polyadic relations. The following 
definitions would serve for the polyadic cases (see end of § 36). 

r &@y<f> 1 for r 5(3 a) (3$)(3y) (t5 (a, 0 , 7 ) . py, 
r &i3y8<t> 1 for r <S'( 3 «)( 3 |S)( 37 )( 35 )(ta , (a, 0, 7 , 6) . pp, 

etc. 


§ 38. Converse, Image, Relative Product 

THE CONVERSE x of a relation x is the relation which y bears 
to z just in case x(z,y). Where x is the relation parent of, x is 
child of; where x is greater than, x is less than; where * is above, x 
is below. The pairs belonging to x are those of x with their order 
reversed. 

D27. T or r ~r for r W({(MP. 

The breve ‘ v ', like the superior dot of D26, is placed at the be¬ 
ginning in complex cases; e.g., ‘ v (# y)’. 

We thus have the following theorem. 
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[460. (z)(_y)(.v) x(y,%) = x(z,y) 

Proof. *433 (& D27) x(y,z ) =.y, zeV. x(z,y) 

*100 (& D24) = x(z,y) 

The converse of the converse of anything x is the relational part 
of x; and thus the converse of the converse of a relation is the re¬ 
lation itself. 

[461. (a) x = x Proof: [460, *188 (& D27, 26). 

In view of the convention explained in connection with [436 
and *437 (§ 37), the dot is regularly dropped from the positions 
r f" 1 and alike; for these contexts are abbreviations respectively 
of r &(3 £"03,a)" 1 and r ' aj3 fO,a) n , and thus involve the dot in the 
conventionally suppressible ways r f(tj,0) n and Instances 

of this practice appear in the last lines of the next two proofs. 

[462. (y)(x) xCy. = .xC y 

Proof. *119, *123 [446 =f] x C y .= (w)(z)(x(z,w) D y(z,tv)) 

f460, *123 = (zo) (z) (x(zv,z) D y(zv,z)) 

[446 ae. x C y 

The above theorem shows that inclusion on the part of relations 
is equivalent to inclusion on the part of their converses. Three 
corollaries follow. 

[463. (y)(x) x C y x C y 

Proof. [462 x C y . =. x C y 
[461, *224 =.xCy 

[445 =. x C y 

[464. (y)(x) xCy.D.xCy 

Proof. [311 [443.[] x C y ,D. x C y 
[462 D. x C y 

[466. (y)(v) x C y .D. x C y Proof similar, using [463. 

The image of a class y by a relation x, or more briefly the x-image 
of y, symbolically x“y, is the class of all those elements which bear 
x to one or more members of y. Where x is the parent relation 
and y is the class of violinists, x“y is the class of parents of violin¬ 
ists. The members of x“y are the first elements of all those 
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element-pairs which belong to x and have as their second elements 
members of y. 

D28. r (t“vy for r &oms(<x,P)-p*r,y. 

We thus have the following theorems. 

(466. (z)(y) (x) z e x“y . = (3w) (x(z,w) .wey) 

Proof. *230 (& D28) z e x“y . = . z e V . (3 w)(x(z,w) .wey) 
*158 = (3w) (z e V . x(z,w) . w e y) 

*100 (& D24), *123 = (3 w)(x(z,w) .wey) 

1467. (z)(y)(*)(w) x(z,w) . w e y .3. z e x“y 
Proof. *135 x(z,w) .wey.D (3w)(x(z,w) .wey) 

(466 3. z « x“y 

(468. (z)(y)(x) x“y Ci.= ( w)(w'){x{w,w') .w'ey.D.wez) 

Proof. 

(466, *123 (& D21) x“yCz.= {w)((3w')(x{w,w') .w'ey) D.wez) 
*161, *123 = R468 

■(■469. (z)(y)(*) *Cy .3. x“z C y“z 
Proof. (422 * C y .3. x(w,w') 3 y(w,w') (1) 

*100 [1 .] x C y . x(w,w') . w'e z .3. y(w,w’) . w’e z 

(467 D.eoe/'z (2) 

*100, *117 x C y .3 (»)(«/)([2 .] x(w,w') . w'ez .3. w e y“z) 

(468 3.*“zCy"z 

(470. (z)(y)(*) y C z .3. x"y C x“z Proof similar. 

De Morgan remarked that traditional logic was inadequate to 
proving the statement ‘ If horses are animals then heads of horses 
are heads of animals.’ This statement is an instance of (470. 

The next two theorems are distributive laws. They show e.g. 
that the parents of violinists comprise the fathers of violinists and 
the mothers of violinists, and that the wives of classicists comprise 
the wives of Latinists and the wives of Hellenists. 


(471. (z)(y)C*) (x w y)“z = (x“z) ^ (y"z) 

Proof. (424, *188 (& D28) 

(* w y)“z = w(3w')(x(w,w')‘v y(w,w') .w'ez) 

= w(3w')(x(w,w') .w'ez .V. y(w,w') . w’e z) 


*100, *188 
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*141, *188 = w{{3w')(x{w,w'). w'ez) V (3w')(y(w,w'). 

w'tz)) 

|466, *188 (& D19) = (x“z) w (y"z) 

f472. (s) (y)(x) x“(y w z) = (x“y) w (x“z) Proof similar. 

The practice of telescoping duplicate steps, adopted hitherto 
(§32) in connection with *123 and *224, will be usual also in con¬ 
nection with *188, *227, and *245. The last line of the proof of 
f471, e.g., is a condensation of the following two lines: 

f466, *188 = e x“z .V (3a >’){y(w,w') . w'e z)) 

f466, *188 (& D19) = (x“z) ^ (y“z) 

Distributive laws parallel to f471 and f472 do not hold for the 
logical product. The statement: 

(z)(y)(x) (x o y)“z = ( x”z) ^ (y“z) 

is seen to be false by taking x as the relation older than, y as brother 
of, and z as the class of sailors; for ( x ^ y)“z then becomes the 
class of elder brothers of sailors, whereas ( x“z ) ^ (y“z) becomes 
the class of all persons who are at once older than some sailors and 
brothers of some sailors. Even the younger brother of a sailor will 
belong to the latter class, so long as he happens to be older than 
some other sailor. Again, the statement: 

<z)(y)(*) x“{y rs z) = (x“y) ^ (x“z) 

is seen to be false by taking x as the relation of benefiting, y as the 
class of cripples, and z as the class of musicians; for x“(y r\ z) then 
comprises just the benefactors of crippled musicians, whereas 
(x“y) r\ {x“z) takes in also any person who happens to have 
benefited both an unmusical cripple and an uncrippled musician. 
The next theorems explain images involving A and V. 


f473. (*) A‘‘x = 

A 


Proof. 1192 

y;z i A 

(1) 

*100 (& D24) 

1 - 1 

u 

1_1 

l 

t* 

(2) 

*131 

~(3z)(A(y,z). z e x)[— (z)2] 

(3) 

*194 (& D28) 

[(y)3D]473 



f474. (x) x“A = A 


Proof similar. 
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|475. (a) V“x = y(# 5^ A) 

Proof. t420, *188 (& D28) Y‘x = y(3z)(y, leV.ujt) 

1191, *188 = y(3z)(y e V . z e x) 

*158, *188 - y(y eV. (3z)(ze*)) 

*246 = y(3z)(z e x) 

|192b, *188 = y(* ^ A) 

f476. (x) x“V = y(3z) *(y,z) iVoo/V *100, *188 (& D28, 24). 

In view of |475 and *253-*254, V“x is A or V according as x is 
or is not A. On the other hand x“V is, in view of f476, the class 
of all elements bearing x to something. This class is called the 
domain of x. ' Where x is the parent relation, its domain x“W is 
the class of all parents. 

f477. §f)0) y « x“V .= (3z) x(y,z) 

Proof. f466 y e x“V . = (3z)(;r(y,z) . z « V) 

*100 (& D24), *123 = (3z) x(y,z) 

The images of unit classes play an important role. The image 
by x of tz is simply the class of all elements bearing * to z. 
f478. (z) (x) x‘\z - y(x(y,z)) 

Proof. f345, *188 (& D28) x“iz = y(3 w){x(y,w) . i e V. » = z) 
*234b, *188 - 5»(*(y,z) .zeV) 

*100 (& D24), *188 = y(*(y,z)) 

f479. iz){y){x) y e x“iz . = x(y,z) 

Proof. |478, *224 y e x‘\z . = . y t y(x(y,z)) 

*230 =. y e V . x(y,z) 

*100 (& D24) = *(y,z) 

The image of y by x, i.e. x“y, might be called the converse image 
of y by x. It is the class of all elements to which members of y bear 
x. Where x is parent and y violinists, x“y is the class of all children 
of violinists. 

|480. (y)(#) x“y = w(3z)(x(z,w) . zey) 

Proof: f460, *188 (& D28). 
f481. (y)(x)(w) w e x“y .= (f3z)(x{z,zv) . z t y) 

Proof. |466 w e x“y .= (3z)(a(zc,z) .zey) 

f460, *123 [1 =] 481 


(1) 
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The domain x'W of x is called the converse domain of x. It is the 
class of all elements to which x is borne. Whefe x is the relation 
of hate, its converse domain x“V is the class of all who are hated. 

f482. ( x) x“V = z(3y)x(y,z) Proof similar, from f476. 

f483. (z)(x) z « *"V .= (3y)#(y,z) Proof similar, from |477. 

Similarly x“iy is the class of all elements to which y bears x. 

f484. (y)(x) x“iy - z(x(y,z)) Proof similar, from f478. 

|486. (z)(y)(v) z t x“iy .b= x(y,z) Proof similar, from f479. 

The relative product of a relation x into a relation x', symbolically 
x \ x' , is the relation which y bears to w whenever y bears x to some¬ 
thing which bears x' to w. Where x is the brother relation and x' 
the parent relation, x\ x' is the relation of uncle. Where x is the 
father relation and x' the mother relation, x \ x' becomes the rela¬ 
tion of maternal grandfather and x’\x becomes the relation of 
paternal grandmother. 

D29. r (f | t;) 1 for r <Jy(3/3)(f(a, /3). n(P, y)) 1 . 

We thus have the following theorems. 

|486. (x')(y)(x)(w) (# | x’)(y,w) = (3z)(*(y,z) . x'{z,w)) 

Proof from *433 like that of t466 from *230. 
f487. (x')(z)(y)(x)(tv)< x(y,z) . x'{z,tv) .D (x j x')(y, 10 ) 

Proof from |486 like that of f467 from f466. 

|488. (*)(?)(#) x\y C x.m (x'W)(z'){x(x\y') . y(yW) .3.*(*',*')) 
Proof from 'j'446 and f486 like that of f468 from D21 and f466. 

f489. ( Z )(3')( A: ) x C y .3. x | z C y | z 

Proof from |487 and f488 like that of f469 from f467 and f468. 

|490. (z) (y) (x) yCz.D.x\yCx\z Proof similar. 

Relative multiplication is not commutative. A maternal grand¬ 
father, e.g., is not the same as a paternal grandmother. But relative 
multiplication is indeed associative, as the following theorem shows. 

|491. (z)(y)(x) ( x | y) | z = x \ (y | z) 
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Proof. f486, *188 (& D29) 

(* I y) I * - x'«/(3z')((3y')(*(x',y') ■ y(y',*0) ■ *(*>')) 
*158, *188 = (3z')(3y') ') . y(y',z') . s(z ',«/)) 

*138, *188 = x'zt' , (3y')(3z')(x(x',y') . y(y',z') . z(z',w')) 

*158, *188 = x'w'(3y')(x(x',y') . (3z')(y(y',z') . z(z',w'))) 

I486, *188 (& D29) = x | (y | z) 

A null factor nullifies the product. 

f492. ( x ) A | x = A Proof similar to that of f473. 

f493. ( x) x | A = A Proof similar. 

The converse of the relative product is the relative product of 
the converses in opposite order. 

f494. (>0 (x) "(x | y) = y | x 

Proof. f486, *188 (& D27) “(x | y) = wz(3x')(y(«', zv) . x(z, x')) 
|460, *188 (& D29) =y|x 

E.g., since the converse of teacher of is pupil of and the converse 
of child of is parent of, the above theorem tells us that the converse 
of the relative product teacher of child of is the relative product 
parent of pupil of. 

The concluding theorem exhibits an important connection be¬ 
tween the image and relative product. It says e.g. that the uncles 
of violinists are the brothers of the parents of violinists. 

t495. W(y)(x) (x | y)“z = x“(y“z) 

Proof like that of f491, using D28 instead of D29 and f466 in¬ 
stead of the last use of f486. 

The notions of converse and relative product, along with the identity function I 
(cf. § 42), are familiar to students of group theory. In this connection they were 
clearly formulated and investigated by Cayley (1854). But in group theory the 
notions of converse and relative product occur only in application to a special sort 
of relations, viz. functions (in the sense of § 40) whose converses are functions. 
Converses and relative products of relations generally are dealt with by DeMorgan 
(1860), and so also is the notion of image. But DeMorgan does not distinguish 
explicitly between the relative product and the image; the selfsame notation 
designates x \ y or x"y according as y is or is not a relation. The same practice 
recurs in Peirce; but in Frege (1879) the distinction becomes explicit. The present 
notation is Russell’s. 
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§39. The Ancestral 

A CLASS y is said to be closed with respect to a relation x if what¬ 
ever bears a to a member of y belongs in turn to y; i.e., if x"y C y. 
The class of humans, e.g., is closed with respect to the nephew- 
relation; for all nephews of humans are human. The class of males 
is likewise closed with respect to the nephew relation. The class of 
even numbers, again, is closed with respect to the relation square 
of; for the squares of even numbers are even. On the other hand 
the class of males is not closed with respect to the parent relation; 
for not all parents of males are male. Again, the class of even 
numbers is not closed with respect to the relation half of; for the 
halves of even numbers are not always even. 

An element may belong to an infinite variety of classes each of 
which is closed with respect to a given relation x. Napoleon, e.g., 
belongs to an infinite variety of classes each of which is closed with 
respect to the parent relation. One of these is the class of all 
organisms, since parents of organisms are organisms. Even 
broader classes of the kind can be formed, by supplementing the 
class of organisms with any parentless sort of objects — sav desks; 
for no such trivial supplementation will alter the fact that all 
parents of members are members. Moreover, there are classes 
narrower than that of organisms which still contain Napoleon and 
are closed with respect to the parent relation. One such is the 
class of all organisms born before 1800 ; Napoleon belongs to this, 
and so do the parents of all the members. Still narrower classes of 
the kind can be got by casting out all childless persons, or all per¬ 
sons of Chinese descent; for no parents of the residual members 
will thereby be extruded, nor presumably will Napoleon himself. 
Certain members, however, will be common to this whole infinite 
variety of classes. One common member, of course, is Napoleon. 
Two more are the parents of Napoleon; clearly they must belong 
to any class which contains Napoleon and is closed with respect 
to parent. Four others are the parents of Napoleon’s parents; 
and so on. Napoleon and all his ancestors are thus distinguished 
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from the rest of the world by this formal characteristic: they be¬ 
long to every class which contains Napoleon and is closed with 
respect to the parent relation. If we so construe ‘ancestor’ as to 
reckon a person among his own ancestors, then ‘z is an ancestor of 
Napoleon’ can be explained as meaning that z ■ belongs to every 
class which contains Napoleon and is closed with respect to parent; 
symbolically, 

(y) (parent"y C y . Napoleon e y .D. z e y). 

The relation of ancestor can be defined, in terms of that of parent, 
as 

zzc(y)(parent"y Cy.zvty.D.zty). 

More generally, given any relation x instead of parent, the re¬ 
lation 

(1) zw(y)(x“y C y . w t y .D. z e y) 

will be called the ancestral of x; symbolically, *#. Where x is 
the parent relation, *x is the genealogical relation of ancestor; 
where x is the offspring relation, *x is the relation of descendant. 
In general, *x is the relation of z to w such that z is w or else bears 
x to w or else bears x to something which bears x to w or else etc. 
The rigorous formulation of *x, viz. ( 1 ), is important in that it 
constitutes a logical analysis of the essential, notion involved in 
the idiom ‘etc.’. 

We thus adopt the following definition. 

D30. r *f n for r <$d( 7 )(f “7 Cy.^ey.D.ae 7 ) 1 . 

The following theorem is forthcoming. 

f610. (z) (y) (x) («<) x“y C y . *x(z,zv) . w e y .D. z e y 

Proof. 

*110 (y)(x“y C y . w e y .D. z « y) D: x“y C y . w e y .D. z e y (1) 

*433 (& D30) *x(z,w) =. z, w t V . Ll ( 2 ) 

*100 [1.2.D]510 

Every element bears the ancestral of every relation to itself. 

1611. (y)0) *x(y,y) =.ytV 

Proof. *100 x“z C z.ytz.D.yez 


( 1 ) 
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*434 (& D30) *x{y,y) =. y, y e V [. (z)l] 

*100 = .yeV 

Two corollaries follow. 

•f'512. 1 (y)(-v) yC**|y 

Proof. *100 (& D24) y(z ,w) 3, z e V . y(z ,a>) 
f511, *123 3. *x(z,z) . y(z,w) 

f487 D (*x\y)(z,w) (1) 

|446 512 [ = (z)(zt>)l3 

f513. ()')(.'. ) y C y | *x Proof similar. 

Every relation is included in its ancestral. 
t614. (x) x C *x 

Proof. *110 (& D21) x“y C y .3 : z e x“y .3. z e y ( 1 ) 

* 100 , *117 j c{z,w) 3(y)([l . 467 x“y C y . w e y .3. z e y) (2) 

*435 (& D26, 30) [(z)(k >)2 3] 514 

A class is closed with respect to x if and only if it is closed with 
respect to *x. 

f516. (}’)(*) x“y C y . = . *x“y C y 

Proof. 


*100, *117 

x”y C y .3 (z)(a>)([510 *x(z,w) . w e y .3. z e y) 


f468 

3. Wy C y 

( 1 ) 

f469 

[514 3.] x“y C *x“y 

( 2 ) 

'j'311 

[2 .] *x“y C y .3. x“y C y 

(3) 

*100 

[1.3 .3] 515 



The ancestral is its own ancestral, and its own relative self¬ 
product. 

t516. (x) *x = *** Proof: f515, *188 (& D30). 

|517. (#) *x = *x | *x 

Proof. f510 x“y C y . ^(z',z) . z e y .3. z 'ey (1) 

*100, *117 

**(z',z) . *x(z,w) .3 (y)([l . 510 x“y C y , w e y .3. z'e y) (2) 
*161 [(z)2 =.] (3z)L2 3 R2 (3) 

1 Note that the star in 1 *x | y’ applies to ‘x’, not to | y.’ See second footnote 
of § 35. 
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*435 (& D29, 30) [(z3 D *x | *x C *.* (4) 

|512 *x C ** | ** (5) 

t312 517 [=.5.4] 

The next two theorems show e.g. that the parents of one’s 
ancestors and the ancestors of one’s parents figure among one’s 
ancestors. 

f618. (x) x | *x C ** 

Proof. f489 [514 D.] x | *x C *x \ *x (1) 

f517, *224 518 [=1] 

|519. (#) *x | x C Proof similar, using f490. 

The class *x“y comprises all the objects which bear *x to mem¬ 
bers of y. Thus, where x is the parent relation and y the class of 
lawyers, *x“y is the class of all ancestors of lawyers. The following 
theorem says that *x"y is closed with respect to x. 

f620. (y)(x) x"(*x“y) C *x“y 

Proof. |469 [518 D.] (x \ *x)“y C *x"y (1) 

|495, *224 [1 =] 520 

The following sort of inference is characteristic of the ancestral. 
From the premisses: 

(z)(w)(parent(z,w) . w was born before 1800 

.3. z was born before 1800) 

and: 

ancestor (Fiorecchio, Napoleon) 

and: 

Napoleon was born before 1800, 

e.g., we infer that Fiorecchio was born before 1800. In general, 
from premisses of the forms: 

(z) («;)(*(z,w) . ... to ... .D.... z ... ), 

*x ( A,B ), 

...B ... 

we infer that . . . A ... . The metatheorem governing such in¬ 
ferences is as follows. 


*621. If a and a! are distinct and not free in f, and a! is not free in 
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p, and 4>', 0 , and 0 ' are like p except for containing free occurrences re¬ 
spectively of a , Tj, and rj' wherever p contains free occurrences of a, then 

b r (“')(«)(.C(“ , > “) ■ P -3 p’) . i?) ■ 0 -3 0 n . 


Proof. fS10 \- r H“<$p C 

&P . *f(r[, Tj) . 17 e dp .3. 

»? 

e dp 1 . 

( 1 ) 

*100 (& D24) 







h r r(« 

', a) . 0 .2 

10 ': = 

: £(a', a) . a e V . 

0 ■ 

,3. a'e V .0 

n 

*235 (& hp3), * 

123 


I $(ol , a) . a e dp 

.3 

. a* e (X 0 " 1 . 

(. 2 ) 

Similarly 

h r'dp 

C ($0 . 

*$'(’?'» v) ■ t i' 

[: 

- l] n . 

(3) 

f468 (& hpl-2) 


C dtp . 

= (a') (a) R 2 n 




(2), *123 



■ («')(«)L 2 -\ 



(4) 


(4), *123 b r [3 =] 52T. 

Inferences of the above kind remain valid when the first of the 
three premisses, viz.: 

(z) (zv)(x(z,w) .... w ... .D. ... z .. .), 
is weakened to read: 

(z)(w)(#(z ,zv) . *x(zv,B) . . . . w . . . .3. . . . z . . .). 

The metatheorem to this effect is as follows. 


*522. If a and a are distinct and free in neither f nor t), and [etc. 
as in *521], then 

K(“')(“)(f(V> “) • *f(“> v) ■ 0 -3 0') * *f(V> v) ■ 0 -3 0 n . 


Proof. 

t.518 Kf 

*r c 



(1) 

f5U 

»?) = 




(2) 

|422 

K[i 3-] (f 

1 ?) 

3 * 

lT(a', t))" 1 - 

(3) 

CO 

b r f(a', a) . 

(«, »») -3 

a 

r,)\ 

(4) 

*100 

K[4.3.3: 

.] L4.0 .3 

0 ' 

: L4 .0 .3. *f(c/, tj). 

■ 0 n -(5) 


(5), *123 \- r L522 3. («')(«)R5 . v) . 0 1 

*10.0 (&■ D24) 3: [2 3.] (a')(«)R5 . *fW, v) • v) ■ 0 n 

*521 (& hp) 1 D. 

*100 3 0 n . 

Typical use of *521 occurs in the proofs of the next two theorems. 

|523. ( x) = *x 

Proof. f487 *x(w,z) . x(z,y) .3 (*x | x)(zv,y) 

|422 [519 3.] (** | x){w,y) 3 *x(tc,y) 

1 In this step r *f(a, 17 ) . i ^" 1 plays the role of the <t> of *521. 


( 1 ) 

( 2 ) 
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*100 

[460.1 2 .~\x(y,z) . *x(w, z) .3 *x(w,y) 

(3) 

*521 

[(y)(z)3 .] *x(y,w) . *x(w,w) .3 *x(w,y) 

(4) 

f511 

*x(zv,w) =. W € V 

(5) 

*100 (& D24) 

[4.5.] *x(y,w) .3 *x(w,y) 


[460 

3 v *x(y,w) 

( 6 ) 

[446 

*x C ”** [. = (y)(w) 6 ] 

(7) 

f463 

[7 = ,] "*x C *x 

( 8 ) 

*231 

[(#)8 3.] “*# C *x 

(9) 

f461, *224 

[9 = .] v *x C *x 

( 10 ) 

[312 

523 [=. 10.7] 


|524. (x) 

*x | x =#■[** 


Proof. f513 

x C x | *x 

( 1 ) 

[422 

[1 3.] x(z', w) 3 (x | *#)(z', w ) 

( 2 ) 

[422 

[518 3.]] (at | *#)(z ,w) 3 *x{z,w) 

(3) 

f487 

x{y,z) . *x(z,w) .3 (x | *x){y,w) 

(4) 

*100 

[3.4.] x(y,z) . x{z’,w) 3 L3 .3. x(z',w) 3 R4 

(5) 

*521 

[(y)W 5 .] *x{y, z')[. 2 ] .3. x(z', w ) 3 R4 

( 6 ) 

*100 

[6 .] *x(y, z') . x(z', w) .3 R4 

(7) 

[488 

*x | x C x | *x [.= (y)(z')(w»)7] 

(8) 

f462 

[8 ^.] “(m# x)C “(.X | iex) 

(9) 

[494, *224 

[9 =.] x | '*x C ~*x | x 

( 10 ) 

"f523, *224 

[10 =.] x | C *x | x 

(11) 

*231 

[(a;) 11 3.] x | *x C *x | x 

( 12 ) 

[461, *224 

[12 =.] x | *x C *,* | x 

(13) 

|312 

524 [=i 8 . 13] 



Where x is the parent relation, we have seen that the ancestral 
*x is the ancestor relation — provided that we so construe ‘an¬ 
cestor’ as to count every person and indeed every element trivially 
as an ancestor of itself. The relations x | *x and *x \ x, then, are 
respectively the relations “parent of ancestor of” (in the above 
sense of ‘ancestor’) and “ancestor of parent of”. f518 and f519 
showed that these relations are included in the ancestor relation; 
and the above theorem f524 shows further that they are the same. 
This relation *x j x or x \ *x is in fact simply the ancestor relation in 
the narrower, more usual sense; it is the ancestor relation when we 
construe the latter in such a way as not to reckon one among his 
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own ancestors. It is the. relation which y bears to z when y is a 
parent of z or a parent of a parent of z or . . .. 

For any x, in general, the relation *x \ x (or x \ *x) will be called 
the proper ancestral of x. The ancestral of x is the relation of 
an element y to an element z such that y is z or bears x to z or bears 
x to something which bears x to z or . . on the other hand the 
proper ancestral *x \ x is rather the relation which y bears to z 
just in case y bears x to z or bears x to something which bears x to 
z or .. .. 

The line of reasoning used in D30 was first set forth by Frege in 1879 (Begrifl- 
schrift, p. 60), in defining what I have called the proper ancestral. In 1884 he in¬ 
troduced the ordinary ancestral, defining it in precisely the manner of D30 (cf. 
Grundlagen, p. 94; also Grundgeselze, p. 60). 

Whitehead and Russell’s notation for the ancestral is but I have modi¬ 

fied it to because of a disinclination to accord parentheses to singulary opera¬ 
tors. Whitehead and Russell’s notation would have to be construed as involving 
implicit parentheses in the fashion r (f») 1 in order to deal with ambiguous combi¬ 
nations such as ‘ lx* ’. 

Whitehead and Russell’s version of the ancestral diverges from the present one 
(Frege’s) in this minor respect: y does not bear their x* to itself unless y bears x 
to something or something bears x to y (cf. Principia , *90.01). The practical 
advantage of this complication is not apparent. In any case the two versions are 
interdefinable in obvious fashion: 

x* = yz(»x(y, z) . (3w)(x(y, to) V x(w, y))), 

*x = yz(x*(y, z) V. y = z). 

The proper ancestral is accorded a separate definition by Whitehead and Russell, 
under the notation r f po n . (Cf. *910152). 


§40. Functions 

INSTANCES of the notion of function are expressed by such lo¬ 
cutions as ‘quadruple of’, ‘double of’, ‘square of’; and the applica¬ 
tion of a function is expressed by completing such locutions, in the 
fashion ‘double of x’, ‘square of x’. The notation of functional 
application is or more strictly r (fV) n - Thus z‘3 is 12 

where z is quadruple of; z‘3 is 9 where z is square of; and, to take 
a non-numerical example, z'V is tV where z is unit class of. 

A suitable way of formally defining the notation r f‘i 7 n suggests 
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itself when we reflect that a function is simply a relation; quadruple 
of, e.g., is the relation of the quadruple to the quarter. The result 
■lx of applying a function z to an element x is the element which 
bears the relation z to x, symbolically (iy) z(y,x). Thus 

D3id w for r M 

In the usual terminology, z x is the value of the function z for the 
argument x; 12, e.g., is the value of quadruple for the argument 3. 
More briefly, zx is spoken of as the z of x. 

A function is, as observed, a relation. But it is a relation of a 
special sort, having the peculiarity that no two elements bear it to 
the same element. 2 No two numbers, e.g., are quadruples of the 
same number. Common mathematical usage allows the term 
‘function’ a broader scope, indeed, and countenances so-called 
many-valued functions as well as single-valued ones; but we may 
conveniently waive this usage, for the inclusive notion of function 
which it involves is already at hand in the general notion of rela¬ 
tion. It is only where a unique element bears x to y, symbolically 

(1) (3 w)(z){z = w .= x(z,y)), 

that we can speak non-trivially of the x of y; where (1) fails, x‘y 
reduces trivially to A (cf. *197, D31). 

Functionality of x, then, is conveniently explained as single¬ 
valuedness of x\ i.e., as truth of (1) for all elements y to which 
the relation x is borne at all. But there are further relations x 
which, though not functions in this sense, do fulfill (1) for certain 
choices of y. Such relations may, by a natural extension, be spoken 
of as functions in a relative sense; viz., as functions with respect 
to this or that appropriate argument y. Functionality of x with re¬ 
spect to y receives its symbolic expression in (1). 

The class of those elements y with respect to which x is a func¬ 
tion, in this sense, will be called the range of functionality of x, 
symbolically r*. Thus xx is the class of all elements y for which (1) 
holds. 

D32. r rf n for r $(3y)(a)(a = 7 .= 

1 The notation and the definition are both Russell’s (“Mathematical Logic,” 
p. 253). 

4 Here I follow Reano, “Sulla definizione di funzione.” 
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The range of functionality of the relation author of, e.g., is the 
class of uncollaborated writings; the range of functionality of the 
relation member of is the class of all unit classes of elements; and 
the range of functionality of the relation included in has A as sole 
member. 

Ways of paraphrasing ‘ytxx’ are shown in the following two 
theorems. 


|630. 

(3'X-v) 


y eXx .= (3 w){z)(z = w . = x{z,y)) 


Proof. 

f 182 


w = w 

(1) 

*103 



(z)(z = w.= *(z,y))3. [l=]*(a>,y) 

( 2 ) 

*100 (& 

D24), * 

163 

(3zc)([2 .]L 2 )D.y e V 

(3) 

*230 (& 

D32) 


y t xx . = . y eV . R530 


* 

o 

o 



=.[3 .] R530 


t531. 

{ y)(x ) 


y e Xx .= (z)(z = x'y .= »(z,y)) 


Proof. 

*196 (& D31] 

1 (z)(z = w .= x(z,y))D. w = x'y 

(1) 


*223 


w = x'y .3. Ll = R531 

( 2 ) 


* 100 , * 

163 

(3w)(Ll [. 1 . 2 ])D R531 

(3) 


*232 


R531 3 (3w)Ll 

(4) 


*100 


[3.4.530 .3] 531 


If y is 

in the 

range 

of x, then x'y bears x to y; and otherwise 

x'y is A. 





t532. 

(y)(x) 


y e r* .3 x(x‘y, y) 


Proof. 

f 182 


x'y = x'y 

( 1 ) 


|531 


yett.3 R531 



*231 


3. [ 1 =] x(x'y, y) 


t533. 

0)0) 


y t xx .3. x'y = A 


Proof. 

*100 


£530 J y i r# .3 ~ R530 



*197 (& D31) 

3. x'y = A 



From f533 it follows that x'y is A unless y is an element. 
f534. (y) (x) y e V .3. x'y = A 

Proof. fl90 yer#.3.y«V 

*100 [1.533 .3] 534 

Thus y is an element if x'y has a member. 


( 1 ) 
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1635. (z)(y)(x) 

z € x'y .3. y ( V 


Proof. fl92 

Z € A 

(1) 

*223 

x'y = A .3: z e x'y [. as 1] 

(2) 

*100 

[2.534.3]535 


But x‘y is an element regardless of x and y. 


f536. (y)(x) 

x'y e V 


Proof. *223 

x'y = A .3. 536 [= 211[| 

(1) 

*100 (& D24) 

[532.533 . 1 .3] 536 


The paraphrases of ‘y erx’ provided by f530 and f531 

can now 

be supplemented with 

a third. 


f637. (y)(x) 

"h 

II 

i 

s 


Proof. 



f345, *123 (& DlO) 

R537 = (z)([536 .]z = x'y . = . z ex'fy) 

|479, *123 

= R531 

(i) 

*100 

[1.531.33537 



The following theorem exhibits a certain analogy to f495. 

|638. (z)(y)(x) z e ry .3. (x | y)‘z = x'(y'z) 

Proof. f537 z t ry .3. y"iz = i(y'z) 

*223 Dlwe x''(y''iz) . = • w t x‘‘i(y‘z ) 

f495, *224 3: w t (x \ y)'\z . = . w e x"i(y‘z) 

f479, *123 3. (x | y)(w,z) m x{w, y’z) (1) 

*159 (& D4) [(w) 1 s:] z e ry .3 (w)Rl 
*186 (& D31) 3 R538 

If x is the father relation, y is the son relation, and z is Eve, 
then (x | y)‘z is Adam whereas x'(y'z) = x'A = A. Hence the need 
of the antecedent ‘z«ry’ in the above theorem. 

The next concerns the ranges of functionality of relative pro¬ 
ducts. 

f539. (.y) (x) rx - V .3. ry C r(x | y) 

Proof. f536 y'z eV (1) 

*223 rx = V .3: y'z t rx [.= 1] 

f537 3. i(x‘(y‘z)) = x''i(y'z) 

f537 z e ry .3. t(y'z) = y“iz 


( 2 ) 
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*223 3 [2 s= :] r* = V .3. i(x‘(y‘z)) = x“(y“t z) 

|495, *224 3: xx = V .3. i(x‘(y‘z )) - (,x j y)“iz (3) 

*223 R538 3:: z e ry .3: rv = V .3. !.((v j y)‘z) = (x | y)“iz[.:=3] 

|537, *123 3z«ry .3: r* = V .3. z e x(x j y) (4) 

*100, *117 xx = V .3 (z)(z e ry [. 4, 538] .3. z e r( x\y)) 

(q.e.d.; cf. D21) 

Besides functions of the sort here considered, which apply to ar¬ 
guments one at a time, there are those which apply to arguments 
two or more at a time. An example which applies to arguments 
two at a time is the power function, whose value for arguments 
x and y is x y . A non-numerical example is logical sum, whose value 
for arguments x and y is x w y. Just as a function of one argument 
is a dyadic relation, so a function of two arguments is a triadic 
relation; the power function, e.g., is the triadic relation which 
holds among any numbers z, x, and y (in that order) when z = x y . 
A function of three arguments, similarly, is a tetradic relation; 
and so on. In terms of the notions introduced at the end of § 36, 
the definition D31 of application is readily extended to functions 
of two or more arguments. 

r (f'(’7. 0)) 1 for r (ia) f(a, V, 0) n > 
r (t'(v, 0, 0')) 1 for r (, a)Ua,V,e,dr, 

etc. The notion of range admits of extension in similar fashion, 
in terms of the notions introduced at the end of § 37. A range of 
■re-argument functionality, in general, is itself an n-adic relation. 

1 r 2 t 1 for r d7(35) (a) (a = 5 . = f(a, /3, y))" 1 * 
r r 3 for r Pyd(3a')(a)(a = a 1 .= f(a, 0, y, 5)) n , 

etc. 


§41- Abstraction of Functions 

THE FUNCTION quadruple of may be thought of as abstracted 
from the name matrix ‘4 X a’ in somewhat the same way in which 
the class v(4 < x) is abstracted from the statement matrix ‘4 < x’. 
The class x(4 < x) has, we recall, this characteristic feature: an 



226 RELATIONS §41 

element x is a member of the class if and only if 4 < x. The 
parallel feature of the function quadruple of is this: application of 
the function to an element x yields the element 4 X x. 

The notation of functional abstraction is r X a f" 1 . Thus quadruple 
of is X x (4 X x); double of is \ x {2 X x); square of is A-^x 2 ); unit class 
of is \ x lx. The appropriate formal definition of r X„ suggests it¬ 
self when we reflect, e.g., that quadruple of is simply the relation 
of quadruple to quarter, and square of is the relation of square to 
root; i.e., 

X^(4 X x) = yx{y = 4 X x), 

K(x 2 ) = yx(y = x 2 ). 

Thus 

D33. for = f) 1 . 

In view of the general convention governing the choice of odd 
variables in definitions (cf. § 24), the of D33 must be distinct 
from a and the variables of on the other hand a may, and in 
non-trivial cases will, occur in f 

Mathematicians tend in practice to confuse a name matrix f with 
the name r A a of the corresponding function. Such expressions as 
‘4 + x’, ‘x 2 ’, ‘x 2 + 2x — 1’, etc., though commonly treated as if 
they designated functions, actually designate nothing; they are 
not names at all, because of the free variable (cf. § 27). When 
their free variables are supplanted by numerals, moreover, these 
expressions come to designate not functions but numbers. The 
functions which ‘4 X x’, ‘x 2 \ and ‘x 2 + 2x— 1’ suggest are 
properly designated rather by the functional abstracts ‘X x (4 X x)’, 
‘\ x (x 2 ), and ‘\ x (x 2 + 2x — 1)’; these expressions are straightfor¬ 
ward names, devoid of free variables. 

Note further that the distinction in meaning between f and 
r X„ survives even in the case where f itself is a name, and thus 
devoid of free occurrences of a. It is in this case that the so-called 
constant functions” appear; A* 5, e.g., is the function whose 
value is uniformly 5 for all arguments. The names ‘5’ and 
‘A* 5’ are altogether different in meaning; whereas 5 is a number, 
the “constant function” A x 5 is rather the relation which 5 bears to 
all elements (cf. *540). 
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Abstraction, defined in D33 for functions of one argument, is 
extensible to the case of many arguments in obvious fashion: 

r K/3 for r y<Sj3(y - f) n , 
r \#v r 1 for r 86fa(8 = f)L 

etc. (cf. end of §40). X^x*), e.g., is the power function; and 
\ y W) ‘ (5,3) is 125. In what follows, however, attention will be 
limited to the one-argument case. 

The following metatheorem deals with the functional abstract 
r X a in the context r X a f (»j,a) n of relational predication (D24). 
Note that the occurrences of a in r X a f ’ 1 are bound, whereas the 
occurrence of a which terminates r X a f (j?,a) n is free. 

*640. J- r X a f (rt,a) =. a, f t V . rj « 

Proof. *434 (& D33) |- r X«f ( 17 ,«) =. a, v t V . r, = r n 

*228 s.«,feV.i 

As explained, X*(4 X x) is supposed to be the function whose 
value, for any element x as argument, is 4 X x. In general, 
X*(. . . x . . .) is supposed to be the function whose value, for any 
element x as argument, is . .. x ... (granted elementhood in turn 
of ... x . . .). This is provided by the following metatheorem. 

•641.* h r «>feV.D.X a f‘a = rT 

Proof. Let /3 be new. 

* 100 , *117 h r a,f «V.D (/3)(/3 - f a,f «V./S - f ) 1 

*540, *123 D 03)0 - f X«f (fra ))" 1 

*196 (& D31) D.Xaf'a - f 1 . 

Just as the metatheorems *230 and *433 on the abstraction of 
classes and relations were generalized in *235 and *434, the meta¬ 
theorem *541 on functional abstraction is generalized in the 
following. 

1 So far as parentheses are concerned, the prefix r Xet~ l as a whole is treated as if it 
were a singulary operator; thus it is that no parentheses are entered in the left side 
of D33. Accordingly the r Xaf'a~ l of *541 is to be read as composed of r Xaf" 1 and 
a, not of r Xa~ l and cf. second footnote to § 35. The functional abstract of 

r !"'a 1 with respect to a would be rendered rather r Xat(f*a)" 1 . The latter can, as a 
matter of fact, be shown to obey a law somewhat like *541; namely, if a is not 
free in f then f- r rf - V ,D. Xa(f'a) - f 1 . 
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*642. If 9 is like f except for containing free occurrences of rj in 
place of all free occurrences of a, then 

b r V> 9 e V .D. A a f '?j -- 0 1 . 

Proof. *231 (& hp) (- r [(a)541 D] 542 1 . 

The next metatheorem explains the notion of range as applied to 
a functional abstract. 

*643. b r r X«f = * V)T 

Proof. Let /3 be new. 
fl89 b r rX«f = a(aerA a r) n 


*531, *188 

= d(d)(x„r'« = is.= x«r os,a)) 1 


*540, *188 

- d(/3)(X a f 'a = 0 . = . a, f e V . /3 = f)" 1 


*100, *188 

= d(/3)(X a f'a = /3 .D. a, f eV . & = t 



& = f .3 : a, f e V .D. A„f 'a 

= /3) n 

* 

>—* 

O 

* 

00 

00 

-«*(05)(X«f'a-/9.3.a > f€V. l 9-f). 



(/S)(/S = f .3: a> f«V .D. X a f 'a 

II 

s 

-J 

*234a, *188 

= <S(a, f e V . X«f ‘a = r[. 541]) 1 


*100, *188 

= d(a, f e V [. 541]) 1 


*246 

= <$(feV)' 1 . 



The next explains r X a J"V in the cases not covered by *542. 


*544. If 9 is like f except for containing free occurrences of 77 in 
place of all free occurrences of a , then 

b r ~ (v> 0 e V) D. ‘ 77 = A 1 . 

Proof. 

*235 (& hp), *123 b r ~ (i?» & e V) -5" V « d(f e V )" 1 
*543, *224 D.7 ? «rA a r ' 1 

t533 3-X a r ‘v = AT 

Note that the antecedent in *544 is not r ij e V . 9 e V n , but the 
much weaker condition r ~ (17 e V . 9 e V) 1 , i.e. r ij t V .v. 9 1 V" 1 . 
Thus *544 goes farther than f534. 

Statements of the form r X a f (A a f ' 17 , 77 ) n are not all true. If 7 ; 
designates a non-element, or if the result of putting 17 for a in f 
designates a non-element, then r A a f (X a f't?, »)) n will be false; nor 
will there be any term 9 for which r A a f (9, t]) 1 is true. If on the 
other hand there is such a term 9 at all, we can infer r X a f ‘ 77 = 
this is shown in the following metatheorem. 
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*646. 

Proof. Let /8 be new. 


*223 

h r /3 = f .Dct, f e V .3. Kt 'ot = p [:= 541J 1 . 

(1) 

*540 

KM" (ft a) 3. a, 


*100 

3: [1 3.]X«f ‘a = p\ 

(2) 

*233 

KLMW 2 3] 545T 



The value of the unit-class function X*tx for an argument y 
proves to be ty regardless of whether y is an element. 


t546. (y) 

X*tx ‘y = iy 


Proof. f359 

ty e V 

(1) 

*542 

y e V [.1] .3 546 

(2) 

f534 

y i V .3. \ x ix ‘y = A 

(3) 

f343 

y i V .3. ty = A 


*223 

3y t V .3 546 £: = 33 

(4) 

*100 

[2 . 4 .3] 546 



The notion of functional abstraction goes back to Frege ( Grundgesetze , vol. 1, pp. 
14f, 54f). The present notation is Church’s. Both Frege and Church construe 
functional abstraction more broadly, however, to include class abstraction as a 
special case. Correspondingly relational abstraction becomes, for those authors, a 
case of the abstraction of functions of many arguments. Special features of the 
groundwork of Frege’s logic and Church’s make this assimilation a natural one; 
and it would be equally natural in the logic based on inclusion and abstraction 
(cf. § 23). 


§42. Identity and Membership as Relations 

THE IDENTITY function I is defined as follows. 1 
D34. ‘/’/or'X^x’. 

It is the function whose value, for any element as argument, is that 
element itself. 

f660. (x) x « V . = . I‘x = x 

Proof. *541 (& D34) x, x « V .3. I‘x = x (1) 


1 See end of § 38. 
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+536 

Px 6 V 

(2) 

*223 

> 

* 

1 - 1 

III 

<N 

1 _ 1 

n 

H 

II 

(3) 

*100 

[1.3.3]550 



Its range embraces all elements. 
f661. xl = V 

Proof. *543 (& D34) r/ = x(x e V) 
fl89 = V 

Like all functions, I is a relation. In view of D34 and D33, 
indeed, it is the identity relation xy(x = y). Note, however, that 
‘ I(x,y) ’ is not equivalent to ‘ x - y Since only elements can enter 
into a relation, ‘I(x,y)’ holds only where x and y are the same 
element; on the other hand ‘x — y’ holds wherever x and y are the 
same entity, whether an element or a non-element. 


f652. (y) (*) I( x ,y) =. * e V . x = y 

Proof. *434 (& D34, 33) I(x,y) x, y e V . x = y 
*228 =. x, x tV . x = y 

*100 =. x tV . x — y 


f653. (}')(*) I( x ,y) s.yeV.x = y 

Proof. *228, *123 [552 sb] 553 


t664. (y)(*) I(x,y)=.xt ty 

Proof. |345, *123 554 [= 553] 


f665. (x) I(x,x) =. x t V 

Proof. ]552 I(x,x) « V [. 182] 


To say that x includes I is to say that every element bears x to 
itself. 


f556. (x) I C * .= (y)(y t V .3 x{y,y)) 

Proof. f446 I Cx.= (y)(z)(I(y,z) D x(y,z)) 

t552, *123 s (y)(*)(y « V . y = z .3 x(y,z)) 

*100, *123 » (y)(z)(y = x .3: y * V .3 x(y,z)) 

*234a, *123 = R556 


/ is its own converse. 
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Proof. *437 / - / 

fSS2, *188 (& D26) = yz(y « V . y = *) 

t553, *188 (St D27) = } 

Imaging by I has no effect. 
t 668 . (at) I“x = x 

Proof. |553, *188 (St D28) I“x = y(3z)(y = z . z tV . z t x) 
*234b, *188 (& D18) =V r>x 
f296 . = a; 

Relative multiplication by I leaves a relation unchanged. 

|659. (a:) /1 x = x 

Proof. f553, *188 (St D29) I\x = yw{ 3z)(y = z . z e V . x{z,w)) 
*100 (St D24), *188 = yw(3z)(y = z.x(z,w)) 

*234b, *188 (St D26) = x 

f660. (*) x 1 1 = x Proof similar. 

I is included in every ancestral. 
f661. (*) I C*x 

Proof. [511 y < V .D «(y,y) ( 1 ) 

f556 561[«(y)l] 

f662. (at) /w.i' C *x 

Proof. f332 562 [=. 561.514] 

The next theorem shows that / added to the proper ancestral 

yields the ancestral. 

[563. (a?) I\j{x | *a?) = *x 


Proof. [332 L563 C *x [. = . 561.518] (1) 

1422 [1 3.] L563 (z,w) 3 *x(z,w) (2) 

+487 x(y,z) . * x(z,w) ,D(x | *%)(;y,M>) (3) 

*100 [2.3.] x(y,z ). L563 (z,w) .3. 1(y,w) V (x \ *x)(y,zo) 
[424 3 L563 {y,w) (4) 

*521 [(y)(z)4.] *x(y,iv) . L563 (w,w) .3 L563 (y,w) (5) 

|424 L563 (w,w) =. I(w,w) v (x \ *x)(w,w) 

+555, *123 s:»eV.v(*|*jr)(tn,«;) (6) 

*100 (St D24) [5.6.] *x(y,w) .3 L563 (y,w) (7) 

|446 *x C L563 [. = (y)(zf)7] (8) 

+312 563 [ = .1.8] 
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The terms ‘transitive’ and ‘reflexive’, applied hitherto to modes of statement 
composition (§11) and predicates (§25), may by extension be applied also to 
nlations in obvious fashion; historically this is, indeed, their original applica¬ 
tion. A relation tv is transitive if 

(*) (y) M *»(*, y) ■ w(y, z) .D w{x, z); 

or, equivalently (by f488), if w | w C w. Thus 1 is transitive, by |559. A relation 
w is totally reflexive (see note to §25) if every element bears w to itself; or, equiv¬ 
alently (by |556), if I C tv. Thus I is totally reflexive. A relation w is reflexive, 
more generally, if 

(x)(y) tv{x, y) V w{y, x) .3 w{x, x). 

The term ‘commutative’ carries over in similar fashion; but the term ordinarily 
used in connection with relations is rather ‘symmetrical.’ A relation w is sym¬ 
metrical if 

(x)(y) to(x, y) = tv(y, x); 

or, equivalently, if m = w. f557 affirms the symmetry of I. 

We have attributed transitivity, reflexivity, etc., on the one hand to the con¬ 
nective sign which is a two-place predicate, and on the other hand to the 
corresponding relation xy(x = y) or I. By a similar transfer the idempotence, 
commutativity, etc., which have been attributed to the term connectives ‘ rv ’ and 
‘ w’ (§33) might be attributed likewise to the corresponding functions \ xy (x rs y) 
and \ X y(x w y). In general, a function w of two arguments is idempotent, commu¬ 
tative, or associative according as 

te‘(x, x) = x, 
w‘(x, y) = w‘(y, x), 

or w‘(w‘(x, y), z ) = w‘(x, tv‘(y, z)) 

for all elements *, y, and z; and correspondingly for distributivity. 

/ is, we have seen, the identity relation xy(x = y). We turn now 
to (J, which is the membership relation xy(x e y). 

D36. ‘ C’ for ‘xy{xey)\ 

‘ diverges from ‘xey’ in the same way in which ‘I(x,y)’ 

was seen to diverge from ‘x - y’. Since only elements can enter 
into a relation, ‘ G(*,y) ’ holds only where y is an element and 
xey. 

t564. (y)(x) 6(^,y) x e y . y e V 

Proof. *433 (& D35) 0(x,y) =. x e y . x, y e V 

fl91, *123 =. x e y . y e V 

But the elementhood clause is suppressible in the reflexive case. 
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f565. (x) Q(x,x) =. x e x 

Proof. |564 Q(x,x) =. x e x . x e V 

|191 =.xex 

Though ‘xeiy’ and ‘x = y’ are not equivalent (cf. f345), 
‘ G(a;, ty)’ and ‘I(x,y)’ are. 

f566. Mix) Q(x,ty) = I(x,y) 

Proof. f359 ty * V (1) 

fS64 Q(x, ty) = .*eiy[\l] 

|554 = I(x,y) 

Q“x is the class of all members of members of x. 
f667. (x) Q>“x = y(3z)(y e z . z t x) 

Proof. f564, *188 (& D28) 6"* = y (3s) (y € z . z « V. z e *) 

fl91, *188 = R567 

Where x is the class of professional baseball clubs, e.g., (£"* 
is the class of professional baseball players. If lines and planes 
are conceived as classes of points, and x is the class of all lines 
in a given plane, then Q“x is the plane itself. 




CHAPTER SIX 


NUMBER 




§43. Zero , One , Successor 


TO SAY that the Apostles are pious is to attribute a property to 
each man among the Apostles; and to say that we are unfortunate 
is to attribute a property to each individual among us. To say 
that the Apostles are twelve, on the other hand, is to attribute a 
property rather to the class of Apostles; and to say that we are 
seven is to attribute a property to us as a class. These properties 
twelve and seven, symbolically 12 and 7, are properties of classes; 
or, in keeping with our custom of treating properties as classes (cf. 
§ 22), they may be construed as classes of classes. 7 is the class of 
all seven-member classes; 12 is the class of all dozens. In order to 
belong to 7 or 12 or anything else, a class must of course be an 
element; but this is no restriction, for we know that every class 
of seven or twelve or a million members is an element (cf. end of 
§35). 

What has been said of 7 and 12 holds equally of the rest of the 
so-called natural numbers , viz. 0, 1, 2, 3, etc. Thus 3 is the class 
of all three-member classes; 2 is the class of all two-member 
classes; 1 is the class of all one-member classes; and 0 is the class 
of the no-member class, i.e., 

D36. ‘0’ for ‘iA\ 

Given any natural number, say 7, the next (viz. 8) is readily 
defined; for a class belonging to 8 (an eight-member class) is 
characterized by the fact that it comes to belong to 7 when a 
member is dropped. A class x belongs to 8 if and only if the 
removal of a member y from x leaves a class ( x ty; cf. § 35) 
which belongs to 7. 

8 = x(3y){y «x . xr^iy e 7). 

In general, given any natural number z, the next is definable as 

(1) x(3y)(y e x . Xrsiy € z ). 

In terms of 0, accordingly, which is already at hand (D36), we can 
define 1 as follows: 
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D37. ' 1’ for ‘x(3y)(y e x . xr\ty eO)’. 

In terms of 1, in turn, we can define 2 in similar fashion: 

D38. ‘2’ for ‘*(3y)(y e x . xr^ty e 1) 

Similarly we can define 3, 4, and so on, as far as we like. 

Whenever z is an element, (1) designates an element in turn. 

f610. (z) z e V .D *(3y)(y t x . x^ly t z) e V 

Proof. Since ‘(3y)(y « * . (3 w){w e z . (x')(x' ew. = .z'«*.~ 
(y')iy' ex' .=. y' ty ))))’ is stratified, 1 

*200 z ( V .D. x(3y)(y e V . y ex . (3 w)(w eV.wti, (x') 

(x' e V .Dlx' « w . = . x' ex . ~(y')(y' « V.D: y' ex' . = .y't y))))tV 
*100, *123 D. £(3y)(yeV.ye* .(3w)((*'X#'«V .x'ew. = . 

x'eV .x 1 ex . ~(y')(y'«V . y'e*' .s. y'eV, y' ey)) . weV .w«z))eV 
tl91, *123 (& DIO) D. *(3y)(y e * . (3w) ((*')(*' e®. = .*'(*. 

at' ;■* y) . w e z)) e V 

1362, *123 (DIO, 12) D.x(3y)(ye* . *^uy e z) e V 
It is now apparent that 0, 1, 2, etc. are elements. 


ten. 

0 e V 

Proof: 

f359 (& D36). 


t612. 

1 « V 

Proof. 

f610 (& D37) 

[611 I>]612 

f613. 

2 « V 

Proof. 

f610 (& D38) 

[612 D]613 


We can extend this series of theorems as far as we like. Each 
natural number is an element. 

The function whose application to any element z yields the class 
expressed by (1), above, will be called the successor function, S. 

D39. 2 ‘S’ for ‘Kx(3y)(y e X . xr\iy e z) ’. 

1 Note that the use of numerals in showing stratification does not involve any 
circular presupposition of the arithmetic which we are beginning to construct; 
for the numerals so used do not enter into the formulae themselves, but belong rather 
to syntax (cf. Ch. VII) along with the words ‘assignment’, ‘matrix’, etc. The 
case is similar with numerical subscripts on Greek letters. Any sort of counters 
would indeed serve these purposes; the numerical character is not essential. 

* This formulation of the successor function is due essentially to Frege, who is 
likewise responsible for the identification of a natural number with the class of all 
classes having that number of members. See Grundlagen, pp. 79f, 89f. 
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The successor of any element z is the class expressed in (1). 

|614. (z) z e V .3. S'z = x(3y)(y e x . xr\iy t z) 

Proof. *100 [610 .]z* V.D.zeV. R610 

*541 (& D39) 3 R614 

In view then of the elementhood of 0, 1, etc., we have the fol¬ 
lowing theorems. 

|616. 1 = S'O Proof. f614 (& D37) [611 3] 615 

t616. 2 = S'l Proof. f614 (& D38) [612 3] 616 

We can extend this series of theorems as far as we like. 

Though the terminology ‘successor’ suggests interest in S‘z 
only where z is a natural number, still the range of functionality 
of S embraces not merely the natural numbers but all elements. 

|617. rS = V 

Proof. *249 z R610 = V [.= (z)610] (1) 

*543 (& D39), *224 617 [a 1] 

x belongs to the successor of an element z just in case x deprived 
of one of its members belongs to z. 

|618. (z)O) z f V .3: x e S'z .= (3y)(y e x . xrsVy t z) 

Proof. fl90 Xr^iy t z .3. xr^vy « V 


f363 

3.*eV 

(1) 

*100, *163 

(3y) ([1 -3 y e x . xr\vy tz ) D.*eV 

(2) 

*230 

*e*L2. = .*eV. L2 


*100 

=.L2 [.2] 

(3) 

f614 

z « V .3 R614 


*223 

3. R618 [= 3] 


Two corollaries follow. 


f619. (z)(x) 

x e S'z .3(3y)(y « x . xr\ty e z). 


Proof. f535 

x e S'z .3. zeV 

(1) 

*100 

[1.618 .3] 619 



j’620. W(y)(x) y e x . xr\ty t z . z t V .3. x e S'z 

Proof. *135 ye*. xr\Ty « z .3 RR618 

*100 [1.618 .3] 620 


(1) 
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The null class is its own successor. 


f621. S'A = A 

Proof, f 192 xrsVyiA. (1) 

*100 [1 33 ~ (y e * ■ xr\iy e A) (2) 

f614 (& D8) [211 3.] S'A = ^ — (y)2 (3) 

*121 (& D16) [(*)((y)2 = 182)3. 3 s] 621 1 

0 has one and only one member, viz. A. 

[622. A * 0 

Proof. [344 (& D36) [211 =] 622 

[623. ( x ) x « 0 . =. x — A 

Proof. [345 (& D36) *e0.s. [211 .] * = A 

0 is not a successor. 


[624. ( x ) 0 S'* 

Proof. [192 y e A 

*223 0 = S'* .3: [622 s.] A t S'* 

[619 3 (3y) (y e A . Ao,7y e *) 


*156 (& D8) 
*100 


3. ~ (y)l. (3y)(Ar-uy «*) 


[2. (y) 1.3] 624 


( 1 ) 

( 2 ) 


1, the class of all one-member classes, has been expressed in two 
ways in D37 and [615; and it receives four more expressions in the 
course of the following theorems. 


[626. 1 = *(3y)(y e * . * C ty) 

Proof. [623, *188 (& D37) 1 = *(3y)(y e * . xr\Vy = A) 
[321, *188 = x(3y)(ytx.x C f?) 

[275, *227 = R625 


[626. 1 = x(3y)(z)(z e* . = . z = y) 

Proof. [350, *123 626 1= 625] 

[627. 1 = *(3y)(y t V . * = ry) 

Proof. [349, *123 [626 =] 627 


1 This is a striking case of compound bracketing. Its step-by-step resolution 
involves no new idea. First we may bracket out ‘ (y)2 = then' (*) 182 3.’, then 
‘3 = or we may follow any of five other orders. 
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I.e., 1 is the class of unit classes of elements. Also it is the 
range of functionality of the membership relation. 

f628. 1 = re 

Proof. |626 1 = x(3y)(z)(z = y . = . ztx) 

*245 = x(3y)(z)(z = y . = . x e ic(z e x)) 

*230, *188 = *(3y)(z)(z = y.=.iet.teV) 

f564, *188 (& D32) = re 

To have 1 member (or 0, or 2, etc.) is to be a member of 1 (or 0, 
or 2, etc.). The following theorem says that if x has one member 
then anything y will belong to x if and only if x is ty. 


t629. (y)(x) 

x ti .D’.y e x . = . x = ty 


Proof. f358 

z t V .3 : y - z .=. tz = ty 

(1) 

*100 

II 

> 

N 

III 

II 

> 

w 

rn 

n 

iH 

l_1 


f345 

=. y t tz 

(2) 

*223 

x = tz .Dz e V . x = ty . = . y e x [:= 2] 

(3) 

*100, *163 

(3z)([3.] z e V . x = tz) 3 R629 

(4) 

f627, *224 

x e 1 .st. x t x L4 


*230 

= .x e V . L4 

(5) 

*100 

[4.5 .3] 629 



§44. Natural Numbers 

THE NATURAL numbers 0, 1, and 2 have been defined (D36-38), 
and each further natural number is definable by a continuation of 
the same method. Given any natural number z, the next is 

x(3y)(y t x . xrsTy t z) 

or more briefly S'z (cf. t611-f616, § 43). But the class Nn of all 
natural numbers has yet to be formally defined as a whole. 
Roughly, the definition is of course this: Nn comprises 0, 1, 2, etc., 
i.e., 0, S f 0, S‘(S'0), etc. The problem is to formalize the ‘etc.’; 
and the key is provided by the notion of the ancestral. 

S'0 is that which bears the relation S to 0; S‘(S‘0) is that 
which bears S to that which bears S to 0; and so on. To be a 
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natural number, thus, is to be 0 or to bear S to 0 or to bear S to 
something which bears S to 0 or etc. In short, to be a natural 
number is to bear *S to 0 (cf. § 39). Thus Nn is definable as the 
class of all bearers of *S to 0; symbolically, *S“iO (cf. t478). 

D40. ‘Nn’ for ‘(*S"rO)’. 

Nn is closed with respect to S. 
t630. S“Nn C Nn Proof: f520 (& D40). 

I.e., successors of natural numbers are natural numbers. 


|631. (*) 

x e Nn .D. S'x e Nn 


Proof, f 190 

x t Nn .3. x t V 


f617, *224 

D. x e rS 


|532 

3 S(S'x, x) 

(1) 

f467 

S(S‘a;, x) . x t Nn .3. S‘a: e S“Nn 

(2) 

*231 (& D21) 

[630 3:] S'« c S"Nn .3. S'^ t Nn 

(3) 

*100 

[1.2.3 .3]631 



0, 1, etc. are natural numbers. 
t632. 0 c Nn 

Proof. |511 *S(0,0) [s 611] (1) 

f479 (& D40) 632 [si] 

|633. 1 e Nn 

Proof. |631 [632 3.] S‘0 ( Nn (1) 

t615, *224 633 [si] 

f634. 2 e Nn Proof similar, using |633 and f616. 

Any class which is closed with respect to S and contains 0 
contains every natural number. 

f635. (*) S“# C x .0 e x .D. Nn C x 

Proof. fS10 L635 . *S(y,0) .3. y « x (1) 

*100, *117 L635 3 (y)([l.] *S(y,0) .3. y e x) 

f479, *123 (& D21, 40) 3. Nn C * 

The form of inference which we considered in connection with the 
ancestral (§ 39) takes on a more familiar aspect when adapted to 
the theory of natural numbers. From the premisses: 
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(*)(* + * = 2 X * .3. (S'*) + (S'*) = 2 X (S'*)), 

0 + 0 = 2 X 0, 

17 « Nn, 

e.g., we infer that 17 + 17 = 2X17. Or, incorporating the third 
premiss rather into the conclusion and leaving 17 unspecified, we 
infer from the first two of the above premisses that 

(y) y « Nn .3. y + y = 2 X y. 

Such inference, called mathematical induction, is afforded by the 
following metatheorem. 1 

*636. If f, <f>', and <po are like <t> except for containing free occur¬ 
rences respectively of f, r S‘a n , and ‘0’ wherever <j> contains free 
occurrences of a, then 

h r (a)(0 3 </>')• <h ■ £ « Nn .3 ip 1 . 

Proof. Let 0 be new, and let x be formed from <j> by putting /3 


for all free occurrences of a. 

*545 (& D39) h r S(/S,a) 3. /3 = SV. (1) 

*100 I- Tl:] P = S'a .3. p 3 x :3: S(/3,«). 4> -3 x 1 . (2) 

*101 hTW2 3.](/3)L2 3 (/3)R2L (3) 

*234a, *123 (& hp) f- r p 3 4>' .3 (0)R2 [:■ 3J 1 . (4) 

*101 |- T(a)4 3.] («)(* 3 <*>')3 (a)(«R2^ 

*119 3 (/3)(a)R2L (5) 

f479 (& D40) h r f e Nn *S(f,0)L (6) 

*100 f- r [5 • 6 .] L636 .3. (/3)(a)R2 . *S(f,0) . fa 1 

*521 (& hp) D 


Suppose the three premisses in the above example are at hand as 
theorems, numbered say f811, f812, f813. Then *636 gives: 

[811.812.813 .3.] 17 + 17 = 2 X 17 
and more generally: 

(y) [811.812 y « Nn .3. y + y = 2 X y. 

* Mathematical induction was used and explicitly recognized by Pascal in 1654 
(vol. 3, p. 298) and Fermat in 1659 (vol. 2, pp. 431fF). But the principle of mathe¬ 
matical induction retained the status of an ultimate arithmetical axiom until 1879, 
when Frege defined the ancestral and by its means the class of natural numbers. 
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A stronger form of mathematical induction is provided by the 
following metatheorem, which is related to *522 as *636 is related 
to *521. 

*637. If [etc. as in *636] then 

(- r (a)(a e Nn . <f> .3 <j>') . <j > 0 . f e Nn .3 fp. 

Proof similar, from *522. 

Illustrations of the use of *637 will appear in the proofs of 
|638 and |641. 

It may happen that a class x deprived of a certain member y be¬ 
longs to a class z, while x deprived of a different member tv does not 
belong to z. When z is a natural number, however, this will not 
happen; if x deprived of one member belongs to a natural number 
z (i.e., has z members) then x deprived of any different member 
belongs likewise to z. If removal of Peter leaves eleven Apostles, 
removal of John would have had the same effect. This is shown in 
the following theorem. 

f638. (z)(y) (x)(tv) z e Nn .3: y, zv e x . xr^Vu) e z .3. x^Iy e z 

Proof . *223 w = y .3 : xr-,£w e S‘z . = . x^iy e S‘z (1) 

*233 

(et>)(x)R638 3 : y, z'e*nw . (xrsizv)r\Tx't z .3. (x^tM^r-siy e z (2) 
*100, *163 (3x')([2.] x' £ XrxtW . {xryVW )r\ix' € z) 

3: (w)(x)R638 . y e xr\Iw .3 RR2 
f362, *123 3 : (w)(x)R638 . y t x . tv ^ y .3 RR2 (3) 

f362, *123 

tv ( x . tv j* y . RR2 . z £ V .3. tv t xr\TJ . RR2 . z £ V 
f286, *224 3. tv t xr^Vy . x<-\ (j/wr^vy) £ z . z £ V 

t286, *224 3. tv e xr\iy . (xr\Ty)r\iiv t z . z e V 

|620 3 RRl (4) 

f619 LR13 L3 

*100 3: [3.4 .](w)(x)R638 . y,w e x . tv ^ y . z eV .3RRl(5) 

1190 Z£Nn.3.z£V 

*100 3: [5 . 1 .] (k>)(x)R638 . y, tv t x . LR1.3 RRl (6) 

*100, *117 

s e Nn . (w)(x)R638 ,D (w)(x) ([6.] y, w e x . LR1.D RRl) (7) 
|620 tv e x . x/^Itv e 0 [. 611] .3. x £ S‘0 
f615, *224 3. x e 1 


(8) 
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*100 [8.629 .] y, to e x . x^av « 0 .3. x = t y 

*223 3: [294 = .]x^iy = A 

*223 3: xrsly e 0 [.= 622] (9) 

*637 [(z)7 ■ ( w){x)9 ,~]z t Nn .3 (w)(x)R638 

*114 3 R638 

If z is a natural number and y is any member of x, then x will 
have S'z members if and only if x deprived of y has z members. If 
on the other hand y is a member of x (hence any element not be¬ 
longing to x), then x will have z members if and only if x with y 
thrown in has S'z members. Such is the import of the following 


two theorems. 

f639. (4fy)(*) z e Nn . y « x .3 : x e S'z . = . xr^Ty e z 

Proof. fl90 z e Nn .3. z « V (1) 

*100, *163 (3w)([638 .] w t x . xr<vw t z)3: L639 3. x^iy t z (2) 

f619 x t S'z .3 L2 (3) 

*100 [1 . 2.3.620 .3] 639 

f640. (z)(v)(xj z t Nn . y t x .3: x t z . = . x w iy t S'z 

Proof. f344 y e V .3. y t ly 

*100 3:yex.v. yety 

f271 3. y e x^ty (1) 

f639 z « Nn . R1.3: (xwiy)^ty e z . = . x w iy e S'z (2) 
t327 ty C x .3. x = (x^ty)^ty 

*223 3:. zeNn.Rl.3 R640[:= 2] (3) 

f342 y t x . = . y e V . ty C x (4) 

*100 [1.3.4.3^640 


If x and y are finite classes having the same number of members, 
in other words if x and y belong to the same natural number, then 
the following theorem shows that x coincides with y if included 
in y. 

|641. (z)(y)(x) z t Nn .3: x, y t z . x C y .3. x = y 

Proof. f342 e # .3. «/ € V . i .to Car 
*100 3. iw C x 

j'326 3. x = (xr\Iw)\ytw (1) 

*223 xrsiw = yrsVw .3:. [1 w ex .3. x = (ynii) w n« (2) 
*103 [(x)l 3 :] w « y .3. y = (yniwjuiff 

*223 3:: L2 3: w t x .3. x = y [.: = 2] (3) 
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4329 

x C y 

.3. xr-^vw C yr\vw 


(4) 

*233 





(x)(y)R641 3 : 

xrSuw, yr^vw e z . R4 .3 L2 



*100 

3: 

[3.3 xr\ Za>, yr\\u> t z . R4 . tu e x . w t y 

.3. x = 

= y (5) 

f639 

z € Nr 

r. w £ y .3 : y e S'z . = . yrsav € z 


(6) 

*110 

(& D21) 

xCy.D'. wex.D.wey 



*100 


3 : [5 .6 J (x)(y)R641 . xr\ 

iffltl. 

R4. 



w e x . z e Nn . y e S' 

z .3. x 

= y(7) 

*100, 

*163 

(3w) ([4 . 7 w e x . Xr^Tw e z)D : 





(x)(y)R641 . z « Nn . y e S'z . x C y 

.3. x = 

-y (8) 

f619 

x e S'z 

.3 L8 


(9) 

*100, 

*117 





z eNn . (x)(y)R641.3 (*)(y)([8.9-3 x, y t S'z . x C y .3. x = y) (10) 
*100 x, y e0 . x Cy.3.x, y t 0 

f356 (& D36) 3.* = y (11) 

*637 [(z)10 . (x)(y)ll-3 z « Nn .3 (x)(y)R641 

*114 3 R641 

§45. Counter Sets 


WHEN WE augment a class x by throwing in the class itself as 
additional member, we have x w ix. Thus where x is the class of 
the twelve Apostles, x ^ lx has thirteen members: the twelve 
Apostles and the class of them. Where y is this class x ^ lx of 
thirteen, y w iy in turn has fourteen members: the thirteen mem¬ 
bers of y and y itself. In this fashion we can generate an unending 
series of classes, each having one more member than its prede¬ 
cessor. 

x kj lx, which will be called the self-augment of x, is an element 
whenever x is. This cannot be established by direct appeal to *200, 
because the stratification condition is not fulfilled; but it can be 
established indirectly, as in the first line of the following proof. 
The converse also holds, as the rest of the proof shows. 1 

1 With help of f363 it is not difficult to prove, more generally, that 
(y) {x) * f V . = . * u f V. 
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1650. (x) x e V . = . Xkjlx « V 

Proof. |273 x t V [. 359] .3. #wi# e V (1) 

*223 ix = A .3: x^tx = # [. = 297] 

*223 3 650 (2) 

*100 [1.2.343 .3] 650 

The self-augment function, whose value for any element x as 
argument is x^jix, will be called Sa. 

D41. ‘Sa’ for ‘\ x (x^,tx)’. 

In view of f650 we have the following theorems. 

|651. (*) x e V .3. Sa'# = # w t# 

f652. r Sa = V 

Proofs like those of f614 and f617, using |650 instead of f610. 

If x is the class of Apostles, then Sa'#, Sa'(Sa'#), etc. are the 
classes of thirteen members, fourteen members, etc. which were 
considered above. But if instead of starting with that twelve- 
member class we take x as the class of zero members, viz. A, then 
Sa'# is the class A w tA, i.e. tA, having the one member A; 
Sa‘(Sa'#) is tA w uA, having the two members A and tA; 
Sa'(Sa'(Sa'#)) has the three members A, tA, and tA w ttA; and so on. 
The classes making up this series will be called counter sets 3 Just 
as the class Nn of natural numbers comprises 0, S'0, S'(S‘0), 
etc., so the class Cs of counter sets comprises A, Sa'A, Sa‘(Sa‘A), 
etc.; and accordingly, just as Nn was definable as *S“t0, so Cs 
is definable as *Sa“iA. 

D42. ‘Cs’ for ‘ (*Sa"tA) ’. 

The interesting feature of Cs is that it contains one member 
from each of the natural numbers. The counter set A is a member 
of 0, the counter set Sa'A is a member of S‘0, the counter set 
Sa'(Sa'A) is a member of S'(S'O), and so on (cf. f663). In the 
counter sets we have a guarantee that for every natural number z, 
no matter how high, there is a class of z members. This conclusion 

1 So-called because they are to serve as counters in certain formal developments 
rather analogous to the process of counting. The term was suggested, in a different 
context, by Ravven. These sets are von Neumann’s natural numbers. 
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will be drawn formally in f664; and from this basis it will be found 
possible (§46) to proceed to a theorem establishing the infinitude 
of V. 

The following six theorems and metatheorems are exactly 
analogous to six in § 44. 


f653. 


Sa“Cs C Cs 

f654. 

(•*) 

x e Cs .3. Sa/c « Cs 

f655. 


A e Cs 

f656. 

M 

Sa“x C x . A t x .3. Cs C x 

*657. 

If <t>', 

and <t> a are like <f> except for containing free occur- 


fences respectively of r Sa'a 1 , and ‘A’ wherever <j> contains free 
occurrences of a, then 

h 3 <(>') . . f « Cs .3 i/T 

*658. //[etc. as in *657] then 

\- r (a)(a e Cs . <t> .3 4>') ■ <t>o • / e Cs .3 f 1 . 

Proofs like those of f630-f632, |635-*637, using f652, f653, 
and f211 instead of f617, f630, and f611. 

6 is the relation which an element bears to any element of which 
it is a member (cf. §42). Hence its proper ancestral *6 | 6 is 
the relation of an element x to an element y such that x is a mem¬ 
ber of y or a member of a member of y or etc. (cf. end of § 39). 
Now the following theorem shows that no counter set bears 
* G | 6 to itself; i.e., that no counter set is a member of a member 
of... a member of itself. 

f659. ( x ) x f Cs .3 ~ (* G | G)(#,x) 

Proof. t562 Gw/ C*G (1) 

f422 [1 3.] (Gw/)M 3 *G(y,*) (2) 

*100 [650.344 :] y e x .v. y e lx : R650 :3 : 

y e x . x t V .V. y e uc x t x .V. x e ix R650 (3) 

f271, *123 

y e x^yix . R650 .3 : y e x . x e V .v. y < ix x t x^ix . R650[.: = 3] (4) 
f564, *123 G(y> xkjix) 3: G(y>*) v.yei x : G(*» x^Lx)[_. m .= 4] 
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f554, *123 

3. C(y,x) v I(y,x) . G(x, x^lx) 


f424, *123 

3. (Gw/)(y,x) . 6 

(5) 

f487 

*C(*wtx, y) . *6 (y,x) .3 (*Q | *G)(xw«c, 

x) 

f517, *224 

3*6 (x^tx, x) 

(6) 

*100, *163 
(By) ([6 

.5.2.]* G(xwix, y) . 6(y, x^tx)) 3. RR5 

. R6 (7) 

I486, *123 

(*6 6)(x w tx, x w ix)3. RR5 . R6 [:= 7] 


f487 

3 (6 | *G)(x,x) 


t524, *224 

3 (*G 6)(x,x) 

(8) 

*223 Sax 

= Xwtx .3 : (* 6 | G)(Sa‘x, Sa'x) 3 R8 [.= 

8] (9) 

|190 

x « Cs .3. x e V 

(10) 

*100 

[10.651.9 .] x e Cs . R659 .3 ~ LR9 

(11) 

f564 

G(x,A) s.xeA[. 211] 

(12) 

*100 

[12 . 192 .3] ~ (*g(A,x) . G(x,A)) 

(13) 

*131 

~(3x)(*6(A,x) . 6(x,A)) [s (x)13] 

(14) 

t486, *123 

~(*G 1 6)(A,A) [ss 14] 

(15) 

*658 

[(x) 11 . 15 .] x e Cs .3 R659 


It follows that no counter set is a member of itself. 


|660. (x) 

x e Cs .3. x « x 


Proof. tS 12 

6 C *616 

(1) 

-j-422 

[1 3.]G(x,x) 3 (*6 ( G)(x,x) 

(2) 

*100 

[2.659.565 .3] 660 



I.e., Cs is included in Russell’s class. 


f661. Cs C x(x i x ) 

Proof. *313 [(x)660 = ] 661 

Thus V is not a counter set. 
t662. V e Cs 

Proof. f660 V e Cs .3. V i V (1) 

*100 [1.210 .3] 662 

It is not always true that x w ix has one more member than x. 
If x is a member of itself, x w tx is simply x; for example, 
V u iV = V. If x is not an element, x w ix is again simply x 
(cf. f343). When x is a counter set, however, we may be sure that 
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its self-augment x w w (or Sa‘#) will have one more member than 
x. This is established in the following theorem. 


[663. (>')(*) * « Cs . y e Nn .3 : a; e y . = . Sa'# « S'y 

Proof. [190 ,*eCs.3.#€V 

*100 [1.660.651 .] x e Cs .3. x t V . x e x . R651 

*235 (& D20), *123 3.*ez.R651 

f640 y e Nn . x e x .3 x e y . = . x^jix e S'y 

*223 R651 3 y « Nn . x e x .3 R663 [: = 3] 

*100 [2.4 .3] 663 


(1) 

( 2 ) 

( 3 ) 

(4) 


It can now be proved that every natural number has a counter 
set as member. 


[664. (y) y e Nn .3 (3#) (x t Cs . x e y) 

Proof. *232 [655.622 .3] (3*)(* « Cs . x e 0) (1) 

*100 [654.663 .] L663 . x e y .3. R654 . Sa'* e S'y 
*232 3 (3*) (x e Cs . x e S'y) (2) 

*100, *163 (d*)([2 .] x e Cs . x e y) 3 : y t Nn .3 R2 (3) 

*100 [3 .] y t Nn . R664 .3 R2 (4) 

*637 [(y)4 . 1 .] y e Nn .3 R664 


§46. Finite and Infinite 

THE FOREGOING theorem shows that for every natural number 
y, no matter how high, there is a class (in fact a counter set) having 
y members. No natural number is empty. 

[670. A e Nn 

Proof. *100 [192 3] ~ (x e Cs . * e A) (1) 

f664 (& D8) A e Nn .3 ~ (x)l (2) 

*100 [2. W 1.3] 670 

The classes belonging to the successive natural numbers run 
larger and larger without limit, but each is finite. On the other 
hand an infinite class belongs to no natural number whatever; 
where x is an infinite class there is no natural number y such that 
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x has y members. The members of an infinite class are in this sense 
literally innumerable. 

If a class x were finite and yet not an element, then membership 
of a; in a natural number would be impossible despite the finitude 
of x ; for a non-element is incapable of membership. However, we 
know there are no finite non-elements (see end of § 35). Member¬ 
ship in a natural number thus provides a simple and adequate 
formal definition of finitude. The class Fin of all finite classes is 
the class of all members of natural numbers, i.e. (2“Nn (cf. |567). 

D43. ‘Fin’ for ‘(Q“ Nn)\ 

A class is finite if and only if it belongs to a natural number. 

|671. (a:) x e Fin .= (3y)(* e y . y e Nn) 

Proof. f567, *224 (& D43) * e Fin . = . * « x R671 
*230 =.*eV.R671 

*158 = (3y)(R191.yeNn) 

1191, *123 = R671 

A class (element or non-element) is infinite if and only if it does 
not belong to Fin; i.e., if and only if it belongs to no natural 
number. One such class is V. 


f672. (#) ~ (V j j- . x « Nn) 


Proof. f333 

y C V 

(1) 

|641 

x t Nn .3: y, V e x [\ 1] .3. y = V 

(2) 

*223 

y = V .3 : y e Cs £.= 662] 

(3) 

*100, *163 

(3y)([2.3 .] y t Cs . y e *) 3 672 

(4) 

f664 

x t Nn .3 L4 

(5) 

*100 

[4,5 ,3] 672 


|673. V i Fin 

Proof. f671 (& D8) V t Fin . = ~ (x)672 

(1) 

*100 

[1 . (*)672 .3] 673 



The class V of all elements is thus infinite; in other words, 
there are infinitely many elements. 

Whereas all finite classes are elements, it is not true conversely 
that all elements are finite; for V itself is an element (f210) and 
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yet infinite. The class of infinite elements is Fin; and V belongs 
to it. 

f674. V « Fm 

Proof. *235 (& D20) 674 [s. 210.673] 

From |673 we see that for every finite class x there is an element 
not belonging to x\ i.e., there is something belonging to x. 

|675. ( x ) x e Fin .3 (3y)(y e x) 

Proof '. *223 x = V .3: x e Fin [.= 673] (1) 

*100 [1.335 .] * e Fin .3~(VC*) 

*100, *123 (& D21, 8) 3 (3y)(y eV.y ex) 

*230 (& D20), *123 3 (3y)(y ex) 

An important property of the successor function which was not 
established in §§ 43-44 is as follows: if x and y are natural numbers 
having the same successor then x = y. The successor of a natural 
number x determines x uniquely. In fact, if w is the successor of 
a natural number * then x is fixed in terms cf w as 

y(3z)(z e y . iz e w). 


This is established in the following theorem. 

f676. (x) x e Nn .D. x = y(3z)(z e y . y^iz e S‘x) 


Proof. *135 

y t x . x t Nn .3 (3*) (y ex . x t Nn) 


f67l 

3. y « Fin 


|675 

3 (3 z)(zty) 

(1) 

*100, *117 

x e Nn .3 (y)([l 3 :] y e x . = . Rl . y e x) 


*121 

3: [189 =.] x = y(Rl .ye*) 


*158, *123 

3.^: = y(3z)(z < y . y e x) 

(2) 

f640 

x e'Nn . z e y .D: y e x . = . y^LZ e S'^c 

(3) 

*100, *117 

x e Nn .3 (y)(z)([3 D:Jzey.yex.^.zey. 

. RR3) 

*121 

3. [2 =] 676 

(4) 

*100 

[4 3] 676 



It is now easy to prove that if x and y are natural numbers 
having the same successor then x - y. 

f677. (y)(x) x, y e Nn . = S‘y .3. x = y 

Proof. f676 x « Nn .3. x = w(3z)(z t w . e S f A?) 


( 1 ) 
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|676 y t Nn .3. y = w(3z)(z e w . e S'y) (2) 

*228 S'* = S‘y . Rl . R2 . = . S'* = S'y . * = RR2 . R2 
*228 =. S'* = S'y . x = y . R2 (3) 

*100 [1 . 2.3 .3] 677. 

This theorem, despite its elementary appearance, depends for its 
truth upon the infinitude of V. For, suppose V had only a finite 
number of members — say 93. Suppose, in other words, there 
were only 93 elements. Then no class could have 94 or 95 mem¬ 
bers; i.e., there would be no classes belonging to 94 or 95. Hence 
94 and 95 would both be null; i.e., 

S'93 = S‘94 = A. 

Despite the identity of S‘93 and S‘94, however, 93 and 94 would 
remain distinct; for 93 would have V as member while 94 would 
have no members. Thus f677 would be violated. 

Actually, however, V is infinite (f673) and no natural number 
reduces to A (f670). Proof of f677 had to be deferred till after 
proving these things. Note that the proof of f677 uses f676, whose 
proof in turn uses f675, which is a corollary of |673. 

§47. Powers of Relations 

The ancestral ** is, we have seen, the relation of any element z 
to any element w such that z is to or bears * to to or bears * to some¬ 
thing which bears * to w or etc. The pairs composing the relation 
** are hence classifiable as follows. First, there are the pairs 
z\w such that z and to are the same element. The relation com¬ 
prising these pairs is I. Second, there are the pairs z\w such that z 
bears * to w. The relation comprising these pairs is * (* itself, if * 
is a relation). Third, there are the pairs z\w such that z bears * to 
something which bears * to to. The relation comprising these is 
the relative product * ] *. Fourth, there are the pairs z;w such 
that.z bears * to something which bears * to something which bears 
* to to. The relation comprising these is (* | *) !*. The fifth 
relation of the series is ((* | *) | *) | *; and so on. 1 

1 The position of the parentheses is of course immaterial; cf. |491. 
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These relations are known as the relative powers of x, sym¬ 
bolically x°, x 1 , x -, x 3 , etc. If x is the parent relation, then x° is I 
(as always); x* is the parent relation again; x l is the grandparent 
relation.; x 3 is great-grandparent; and so on. Relative power is 
not to be confused with the familiar notion of power which occurs 
in arithmetic; the relative power stands rather to the relative 
product as the arithmetical power stands to the arithmetical 
product. The arithmetical notions of sum, product, and power 
will be taken up in § 48; but the present notion of relative power 
also has an arithmetical ingredient, in that the y of x y is ordinarily 
a number. 

Where y is any natural number, y? is the relation borne by the 
first element Zo to the last element z y of any sequence zo, Zi, . . . , 
z y such that x(zo, Zi)j x(zi, %), . . . , x(z y _i, z y ). Where x is the father 
relation, e.g,, 

x 4 (Kenan, Methuselah); 

for it is written that x(Kenan, Mahalaleel) and x(Mahalaleel, 
Jared) and x(Jared, Enoch) and x(Enoch, Methuselah). But the 
sequence zo, Zi, ... , z y need not always be composed, like 

Kenan, Mahalaleel, Jared, Enoch, Methuselah, 

of distinct elements; where x is the brother relation, e.g., Romulus 
will bear x 6 to Remus in view merely of the repetitive sequence 

Romulus, Remus, Romulus, Remus, Romulus, Remus. 

The intuitive description of x y at the head of the foregoing para¬ 
graph turns on numerical subscripts, which must now be elimi¬ 
nated if we are to devise a formal definition of x y . The business of 
counting off, accomplished informally by the subscripts, can be 
accomplished more formally by pairing the elements Zo, Zi, . . . , z y 
with the successive natural numbers from 0 to y in the fashion 
zo;0, zi;l, . . . , Zy\y. The relation which any such pair z,-;t bears 
to the next following pair z, + i ; i + 1 will be called bx. This is a 
relation of pairs, and hence a class of pairs of pairs; it is the 
relation of any pair nr,n to any pair m'\(S‘n) such that n is a 
natural number and x(m,m'). 

To say that z bears x y to an element w, now, is to say that z;0 
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is connected with w,y by a bx-series of pairs; in other words, that 
z;0 either is w,y (in which case y = 0) or else bears bx to w,y (in 
which case y = 1) or else bears bx to something which bears bx to 
w;y (in which case y = 2) or else .... In short, the power x 31 is the 
relation of any element z to any element w such that z;0 bears the 
ancestral *bx to w\y. 

The auxiliary relation bx admits readily of formal definition. 
D44. r bf for 

r W(la')(30')(ly)(a = a';y . /S = /3';(S‘t) ■ 7 « Nn . f(a',/S'))' 1 . 


Finally the power x y is zw(*bx(z;0, w,y)), as remarked. 

D45. r (fT for r ^(*bf(a;0, /S W ))T 

Suppose, e.g., that x is the father relation. Then any father 
paired with any natural number will bear bx to his child paired 
with the next natural number. Thus in particular 


Accordingly 
and hence 


bx (Kenan;0, Mahalaleel;l), 
bx (Mahalaleeljl, Jared;2), 
bx (Jared;2, Enoch;3), 
bx (Enoch;3, Methuselah;4). 

*bx (Kenan;0, Methuselah;4) 

x 4 (Kenan, Methuselah). 


Only the first two theorems make explicit mention of bx. 


f680. {z)(y)(x)(u>) 

w t Nn .D. bx(y, z;(S‘u>)) = (3y')(y “ y'\ w ■ x (y't z )) 


Proof. f536 

.S'»eV 



(1) 

f536 


S V e V 



(2) 

|411 


y'-,w e V 



(3) 

|411 


z;(S'w) e V 



(4) 

*223 


w = »'.D:ai6 Nn . = . w' « 

Nn 


(5) 

*226 


w = to' .D. S ‘w = S ‘w' 



(6) 

f677 


w, w' e Nn . S'w = S 'w' .D. 

w = w' 


(7) 

*100 

[5.6. 

7.]i»f Nn .D: w = zo' . = . 

S ‘w = S'zv '. w' 

eNn 

(8) 

f417 

z'eV[. : 

2 . 1] .D: z;(S 'w) = z';(S 'w') 

. = . z = z'. S‘w 

= SV 

(9) 
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*434 (& D44) LR680 =. y e V [. 4] . (3y')(3z')(3«' , )Cy = y'\w' 

LR9 . w' e Nn . x(y',z')) (10) 

*234b 

y = y'v» • x(y',z) .= (3z')(y = y'-,w . x{y',z') . z = z') 

*234b, *123 = (3s')(3K'')(y = y'W- x(y',z '). z = z'. w=w') (11) 

*121 [(y')ll 3:]y e V. @y')Rll .s.ye V. (3/)Lll 

*158 k (3/)(y« V.L11) 

*228, *123 = (3yO([3.]LH) (12) 

*100 (& D24), *117 
w e Nn .3 (y')(z')MP-9 J:] 

LR9 . w' t Nn . x(y', z') . = . x(y\ z') . z = z' . w = w ') 
*121 3:. [10 a=:] LR680 e.yeV. (3y') Rll 

*122 D.R680[=12] 

f681. fv)(.v) ytV. 3. x 31 = zw(z — w . 0 = y .v (*&*• | b#)(z;0, w;y)) 
Proof. f411 z;0eV (1) 

t563,*224 *bx(z;0, w;y) = (/w(bx | *bx))(z;0, w,y) 
f524,*224 = CM*b* | b*))(z;0, w;y) 

f424 =. 7(z;0, w,y) V (*bx | boc)(z;0, w,y) 

f552,*123 =: [1 „]z;0 = w\y .v (*b,it | bx)(z;0, w\y) (2) 

f4l7 y, z t V [. 6113 .3 : z;0 = w\y . = . z = w . 0 = y (3) 
*100,*117 

y « V .3 (z)(w)([2. 3 .D:] z, to { V . L2 . = . z, is e V. RR3 v RR2) 
*433 (& D45), *123 

3 {z){w){x > (z,w) = RR681(z,m>)) 

f447 3 R681 


The next two are the main theorems on relative powers. 


f682. 

(x) 

II 

O 


Proof. |624 

0 ^ S‘y' 

(1) 

f536 


S'y'e V 

(2) 

|416 


z e V [. 611. 2~] . z;0 = z';(S‘y') .3. 0 = 

S'y' (3) 

*100, 

*163 

(3y) (3z') (3y') ([ 1 .3 .] w = y ;y' . z;0 = z' 

;(SV). 



y' « Nn . x(y,z')) 3. 

z e V (4) 

*434 1 

(& D44) 

bx(zv, z;0) s. w, z;0 t V . L4 

(5) 

*100, 

*163 

(3a»)([5.4 .] *b# (y;0, w) . L5) 3. z i V 

(6) 

I486, 

*123 

(*b* | b#)(y;0, z;0) 3. z e V [:= 6] 

(7) 

tl82 


0 = 0 

(8) 
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1681 

C611 3.] *° = yz(y = z [. 8] .V L7) 


*100 (& D25), 3 

*188 = yz(z e V: y = z .V L7) 


*100, *188 

= yz([7 3.] zeV.y = z) 


f553, *188 (& D26) = I 


*437 

= / 


1683. (y)(*) 

y € Nn .3. # s ‘ y = x? \ x 


Proof. f624 

0 5 * S'y 

(1) 

f536 

S'y t V 

(2) 

*100 

[4 3] ~ (z = w . 0 = S J y) 

(3) 

f681 [2 3.] 

# Sy = vw(z = w . 0 = S‘y .V (*b* | b*)(z;0, w 

;(S'y))) 

*100, *188 

= zz£([3 3] (*bx | b*)(z;0, w;(S'y))) 


OO 

oo 

* 

vO 

oo 

rp 

4— 

= z«'(3y')(*b*(z;0, y'). b*(y', w;(S'y))) 

(4) 

1680, *117 




y 6 Nn .3(z) {w)(y')(bx(y\ w;( S'y)) = (3z')(*(z',aO . y' = z';y)) 
*121 3: [4=.] 

>= 2w(3/)(*b*(z;0, y') . (3z')(*(z» . y' = *';y)) 
*158,*123 3. * s ‘ y = £0(3y')(3*')(*b*(*;O, y') . *(z». y' = z';y) 

*138,*123 3.x s ' y = m-(3z')(3y')( “ “ “ ) 

*234b,*123 3. * s ' y = ZK-(3z')(*b*(z;0, z';y) . x{z\ w)) 

*100 (& D25), *123 

D. x Sy = zib{z e V . (3z')(*b#(z;0, z';y). x(z',w))) 

*158, *123 3. x s ‘ y = £w(3z')(* « V . *b*(z;0, z';y) . *(z») 

*100 (& D24), *123 

3. x Sy = zA(3z')(z, z'eV. *b*(z;0, z';y) . *(z', w)) 

*433, *123 (& D45, 29) 

3 R683 

For each natural number y, the above two theorems explain x y 
step by step in terms finally of /, x, and relative product. 

*° = I, 

x 1 - x° | x = I 1 * (= x; cf. f559), 

x 2 = x 1 | x = (/ | x) I x ( = x I x), 

x 3 = * 2 I * = ((/ I x) I #) I a: ( = x \ x \ x), 

etc. Where y is not a natural number, is uninteresting — 
though of course still defined by D45. 

It is next shown, by mathematical induction on the basis of |682 
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and f683, that the 

order of the relative product in 

f683 is in- 

essential. 



f684. (y)(x) 

y e Nn .3. x |(af y ) = x 1 \ x 


Proof. *226 

R684 3. (a; | (a^)) | x = {x? | ac) | x 


t491, *224 

3 . ac | (ac y | ae) = (ac y | ar) | ac 

(1) 

*223 

R683 3 R684 3. * | (x s ' y ) = ac s ‘ y | a: 

■[:* 1] (2) 

*100 

[683.2 .] y e Nn . R684 .3 RR2 

(3) 

f559, *224 

x \ I — 1 1 * [. = 560] 

(4) 

■f682, *224 

x j (a; 0 ) = a: 0 | x [. = 4] 

(5) 

*637 

[(y)3.5 .] y e Nn .3 R684 


f685. (y)(x) 

y e Nn .3. ar Sy = x \ ( x y ) 


Proof *223 

R684 3. 685 [= 683] 

(1) 

*100 

[684.1 .3 ] 685 



The concluding theorem shows that if the range of a function x 
embraces all elements, so do the ranges of the relative powers of x. 


f686. (y) (x) y t Nn . r# = V .3. rCac 31 ) = V 

Proof. f539 xx = V .3. r(*») C r(* | (*»)) (1) 

*223 rtf) = V .3:. [1 -:] Ll 3. V C r(* | (*>)) 

f335, *123 3: Ll 3. r(* i (*»)) = V (2) 

*223 R685 3::r(* y ) = V.3:Ll 3.r(x s ' y ) - V[.:-2] (3) 

*100, *117 

Ll 3 (y) ([685.3 .] y c Nn . r(**) = V .3. r(* sy ) = V) (4) 
f682, *224 t(*°) - V[.« 551] (5) 

*637 R4 [. 5] . y e Nn .3. r(ar*) = V (6) 

*100 [4.6 .3] 686 


Whitehead and Russell (vol. 3, *301) define the relative power by a different and 
more complicated method. Between my version of relative power and theirs there 
is incidentally a minor divergence analogous to that noted in connection with the 
ancestral. This divergence shows itself only in the case of x a , which is not / for 
Whitehead and Russell but rather 

y!(y - * . (3w)(v(y, w) V x(w, y))). 

See the end of § 49 for two more theorems on relative powers. 
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§48. Arithmetical Sum, Product, Power 

WHERE x and y are any natural numbers, their sum x + y is the 
number obtained from x by y additions of 1. In other words, 
x 4- y is the successor of the successor .. . (y times) of x. Thus 
x + y is definable as that which bears the yth relative power of the 
successor relation to x', symbolically S y ‘x. 

D46. r tt + vr for r (S r, ‘^)\ 

The product x X y of natural numbers x and y, again, is the 
number obtained from 0 by y additions of x. Thus x X y is de¬ 
finable as that which bears the yth relative power of \(x + z) to 0; 
symbolically \ z (x + z) y ‘0. 

D47. r (fX,r for r (A«(r + a)’ ‘0) n . 

The power x A y> again, is the number obtained from 1 by y mul¬ 
tiplications of x. 1 It is definable as that which bears the yth 
relative power of \(x x z) to 1. 

D48. r (fAi?)' 1 for r (X„(f x a) v T) n . 

|683 and f682 were seen to explain x y step by step in terms 
finally of I, x, and relative product, where y is any natural number 
(cf. § 47). Analogously the following two theorems explain x + y 
step by step in terms finally of x and successor. 

f690. (*) x e Nn .D. x + 0 = x 

Proof. fl90 a;«Nn.D.a;eV 

f550 D.I‘x = x 

f682, *224 (& D46) 3.x + 0 = x 

|691. (y) (x) y e Nn .3. x + (S'y) = S'(* + y) 

1 This notation, consisting of a radical sign inverted, is Peano’s. I have adopted 
it in preference to the usual exponent notation for two reasons: to avoid confusion 
with the notion of relative power, and to prepare the way for the attachment of 
subscripts r s 5 and f r’ in § 50. Beyond the scope of the present study these con¬ 
siderations cease to be relevant, and reversion to the usual exponent notation 
becomes possible. 
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Proof |190 x«r(S s>l ) .3. a; e V (1) 

|538 (& D46) x « r(S y ) .3.(S | (S y ))‘x - S‘(* + y) (2) 
f533 (& D46) ~ LI 3. * +(S'y) = A (3) 

■f 533 (& D46) ~ L2 3. * + y = A 
*223 3:S'(* + y) = A[.= 621] 

*223 3: ~ LI 3 R691 [. = 3] (4) 

|686 y f Nn [. 617] .3. r(S y ) = V 

*223 3:L13L2[.= 1] (5) 

t685 y e Nn .3. S s ‘ y - S | (S y ) 

*223 (& D46) 3: L2 3 R691 [.= 2] (6) 

*100 [4.5.6.3] 691 


Thus, where x is any natural number, 
x + 0 = x, 

x + 1 = S‘(x 4- 0) = S'x, 
x + 2 = S‘(x + 1) = S'(S'x), 
x + 3 = S'(x + 2) = S'(S‘(S‘x)), etc. 

Analogously the following two theorems explain x X y step by 
step in terms finally of 0, x, and sum. 

|692. (*) x X 0 - 0 

Proof. f550 [611 ■.] I‘0 = 0 (1) 

t682, *224 (& D47) 692 [ = 1] 


f693. (y)(x) y e Nn .3. * x (S‘y) = x + (x x y) 

Proof. f536 (& D46) x + z e V (1) 

1536 (& D46) x + (* X y) e V (2) 

|536 (& D47) x x y < V (3) 

*542 [3.2 .3.] \{x + z) ‘ (x x y) = x + (x x y) (4) 

*193 [(z)l 3.] V = z 1 

*543 = r \(x + z) (5) 

|686 y t Nn [. 5] .3. r(Xj(# + r,) y ) = V 
*223 3:0«r(X l (* + z) 3 ')[.= 611] 

f538 (& D47) 3. (\{x + z) | (X, (* + z) y ))'0 = L4 

*223 3: “ “ “ “ = R4[. = 4](6) 

f685 y e Nn .3. \ z (x + z) s ' y = \(x + z) | (\(x + z) y ) 

*223 (& D47) 3. 693 [= 6] (7) 

*100 [7 3] 693 
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Thus, where x is any natural number, 

X X 0 = 0, 

x X 1 = x + (* X 0) = x + 0 = x, 
xX2=x + (xXl)=x + x, 
x x 3 = x + (x X 2) = x + (x + x), etc. 

Analogously the following two theorems explain x A y step by 
step in terms finally of 1, x, and product. 

f694. (x) x A 0 = 1 

[696. (}')(*) y e Nn .3. x A (S‘y) = x X (x A y) 

Proofs analogous to those of [692 and [693. 

Thus, where x is any natural number, 
x AO = 1, 

xAi=xx(xA0)=xX'l=x > 
*A2=*x(*Ai) = x X x, 
x A 3 = * X (x A 2) = x X (a: X x), 

etc. 

Where x and y are not natural numbers, x + y, x X y, and x A y 
are uninteresting — though of course still defined by D46-48, 
So long as x and y are natural numbers, x 4- y, x X y, and x A y are 
natural numbers as well; this is established in the following three 
theorems. These theorems, and subsequent ones likewise, are 
proved from the above six with help of mathematical induction. 


[696. (y) (at) x, y t Nn .3. x + y t Nn 

Proof. [631 R696 3. S'(x + y) « Nn (1) 

*223 R691 DR696 D.x+ (S‘y) e Nn [:« 1] (2) 

*100 [691.2 .] y e Nn : x e Nn .3 R696 :3: x e Nn .3 RR2 (3) 

*223 R690 3:* + OeNn . = .*«Nn (4) 

*100 [690.4 /] x f Nn .3. x + 0 e Nn (5) 

*637 [(y)3.5 .] y e Nn .3: x « Nn .3 R696 (6) 

*100 [6 3] 696 

|697. (y)(x) x, y eNn .3. x X y e Nn 

Proof. f696 x, x X y e Nn .3. x + (* X y) e Nn (1) 

*223 R693 3Ll 3. a: x (S‘y) « Nn [:- 1] (2) 


*100,*117 x e Nn .3(y)([693.2 .] y e Nn . x X y e Nn .3RR2) (3) 
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|692, *224 

a: X 0 € Nn [. = 632] 

(4) 

*637 

R3 [. 4] . y e Nn .3. x X y e Nn 

(5) 

*100 

[3.5 .3] 697 


t698. (y)(») 

x, y e Nn .3. x A y e Nn 



Proof similar. 


§49. Familiar Identities of Arithmetic 

THE FOLLOWING is the associative law of addition. 
f710. (z)(y)(„v) x, y, z e Nn .3. (x + y) + z = x + (y + z) 


Proof. f690 R696 3. (x + y) + 0 = x + y (1) 

|690 y e Nn .3. y + 0 = y 

*223 3R696 3. {x + y) + 0 = x + (y + 0) [:= 1] (2) 

*100 [696.2 .] L696 .3. (x + y) + 0 = x -f- (y + 0) (3) 

f691 z e Nn .3. (x + y) + (S'z) = S‘((x + y) + z) (4) 

*223 R710 3[4 -L4 3. LR4 - S‘(* + (y + z)) (5) 

'('696 y,z e Nn .3. y + z e Nn 
|691 3. x + (S'(y + s)) = S'(* + (y + z)) 


*223 3::R710 3: L4 3. LR4 = * + (S f (y + *))[.:*5](6) 

+691 L4 3.y + (S'z) = S‘(y + z) 

*223 3 L6 3 R710 3 : L4 3. LR4 = * + (y + (S'z)) 

6] (7) 

*100 [7.]L4.L696 3 R710 .3 : L696 3. LR4 = x+(y + (S'z))(8) 
*637 [(z)8 .3 .] L4 .3. L696 3 R710 (9) 

*100 [9 3] 710 

The above theorem carries an antecedent which limits the values 
of all the variables to natural numbers; and the same is true of the 
eight ensuing theorems. Proofs of such theorems can be shortened 
by agreeing simply to disregard clauses of the form e Nn 1 . 
This course is justified by the fact that any clause e Nn 1 which 
would be required in the full proof of such a theorem is readily 
derivable from the antecedent of the theorem with help of f631 
and f696-f698 (or perhaps provided outright by f632 or f633). 
Shortened in this fashion, the above proof assumes the following 
form. 
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f690 

(* + y)+0 = * + y 


|690, *227 

= x + (y + 0) 

(1) 

*226 

R710 3. S'((* + y) + z) = S‘(* + (y + z)) 


t691, *224 

3. (x + y) + (S'z) = * 4- (S‘(y + z)) 


f691, *224 

3. (x + y) + (S'z). = * + (y 4- (S'z)) 

(2) 

*637 

[(z)2 . 1 .3] R710 



The proofs of the next eight theorems are rendered in the same 
elliptical fashion; the reader can easily restore the full details on 
comparing the above two versions of the proof of f710. 


f711. (x) x e 

Nn .3. x x 1 = x 


Proof. |615, *227 

x X 1 = x X (S‘0) 


|693 

= t + (*x0) 


|692, *227 

= x + 0 


f690 

= X 


|712. (y)(x) x, 

y t Nn .3. (S'*) + y = S‘(* + y) 


Proof. f690 

(S'*) + 0 = S'* 


1690, *227 

= S'(* + 0) 

(1) 

*226 

R712 3. S‘((S'*) + y) = S'(S‘(* + y)) 


|691, *224 

3.(S'*) + (S'y) = S‘(* + (S'y)) 

(2) 

*637 

[(y)2 . 1 .3] R712 



The next is the commutative law of addition. 


f713. (y) (-v) x, y e Nn .3. x + y = y + x 


Proof. *223 R713 3: [R712 = .] (S'at) + y = S ‘(y + *) 
f691, *224 3. (S'*) +y = y+ (S'*) 

(1) 

*231 

£(y)l 3 f] *4-0 = 04 -*.3. (S'*) + 0 = 0 4- ( S'*) 

(2) 

1182 

0 + 0 = 0 + 0 

(3) 

*637 

£(*)2. 3 .3.] y 4- 0 = 0 + y 

(4) 

1183 

III 

L_1 

O 

+ 

vi 

II 

+ 

o 

(5) 

*637 

[(*)1 . 5 .3] R713 



The next is the law of distributivity of multiplication into 
addition. 


|714. (z)(y)(*) *, y, z e Nn .3. *X(j+i) = (j:Xy) + (tXi) 

Proof. f690, *227 xx(y+0)=xxy 
|690 = (* X y) + 0 
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+692, *227 
|710 

1713, *227 

+710 

*223 

f693, *224 
+691, *224 
*637 
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= (* x y) + (* X 0) (1) 

* + ((* x y) + (* x z)) = (* + (* X y)) + (* x z) 

= (C* X y) + x) + (* X z) 

= (a: x y) + (* + (* x z)) (2) 

R714 3: x + (* x (y + z)) = R2 [.= 2] 

3. * x (S'(y + z)) = (* x j) + (* X (S'z)) 

3. * x (y + (S'z)) = (» x y) + (* X (S'z)) (3) 

[(z)3 . 1 .3] R714 


Next come the associative and commutative laws of multiplica¬ 
tion. 


f715. 

(z)(y)(*) 

*, y, z e Nn .3. (* X y) X z = * X 

(y x z) 

Proof. f692 

(* X y) X 0 = 0 


f692 


= * X 0 


1692, 

*227 

= * x (y x 0) 

(1) 

|714 

(* x y) 

-1- (* x (y x z)) = * x (y + (y x z)) 

(2) 

*223 

R715 3: 

: (* x y) + ((* x y) x z) = R2 [.= 2] 


t693, 

*224 3. 

, (* x y) x (S'z) = * x (y x (S'z)) 

(3) 

*637 

?r> 

i—i 

.3] R715 


t716. 

(>■)(*•) 

*, y « Nn .3. * X y = y X * 


Proof. +692 (S'*) X 0= 0 


+692, 

+690 

= {x x 0) + 0 

(1) 

+712 

(S'*) + 

((* xy)+y) = S'(* + ((* x y) + y)) 


+710, 

*227 

= S‘((* +(* x y)) + y) 


+691 


= (* + (* X y)) + (S'y) 

(2) 


*223 

(S'*) x y = (# x y) + y .3: (S'*) + ((S'*) x y) = R2 [.= 2] 
f693, *224 3. (S'*) X (S'y) = (* x(S'y)) + (S'y) (3) 

*637 [(y)3 .1.3.] (S'*) X y = (* x y) + y (4) 

*223 R716 3: [4 =.] (S'*) x y = (y X *) + y 

+713, *224 3. (S'*) X y = y + (y X *) 

t693, *224 3. (S'*) X y = y X ( S'*) 

Rest of proof like last five lines of proof of +713. 

The next two are the familiar laws for breaking up exponents. 

+717. (z)(y)(*) *, y, z t Nn .3. * A (y + z) = (* A y) x (* A z) 
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Proof f690, *227 *A(y+0)=*Ay 
|7il =(*Ay)xl 

f694, *227 = (x A y) x (x A 0) (1) 

*226 R717 3.(*A(y +z)) X x = ((at A y) x (* A z)) X X 

f715, *224 3. (# A(y +i))x*=(*Aj)x ((# A z) X x) 

|716, *224 3. x X (jc A(y + z)) =(*A7)x(*x(*A z)) 

f695, *224 D.xA (S‘(y + z)) = (*Ay)x(*A (S'z)) 

t691, *224 D.'a; A (y + (S'z)) = (*Ay)x(*A (S'z)) (2) 

*637 [(z)2.1.3]R717 

f718. (z)(y) (x) x, y, z e Nn .3. * A (7 X z) =(*Ay)A* 

Proof. |692, *227 i,A(jX 0) = * A 0 
f694 = 1 

t694 = (x Ay) AO (1) 

f693, *227 * A(y x (S'z)) = x A(y + (y x z)) 

f7i7 = (x A y) x (x A(y x z)) (2) 

*223 R718 3: [2 «.] L2 = (* A y) x ((* A y) A *) 

f695, *224 3. L2 - (* A y) A (S'z) (3) 

*637 [(z)3 . 1 .3] R718 

Note that laws analogous to the above hold also for relative 
powers. 

|719. (z)(y)(*) y, z e Nn .3. A +I = (A)|(**) 

t720. (*)(y)W y, z € Nn .3. A Xl - (A) 1 

Proofs similar, using |560, f682, f491, f684, and f719 

instead respectively of f711, f694, 'f715, f716, f695, and f717. 

Further arithmetical notions which are readily defined are the 
notions of greater and less. 

r (f v) 1 for r (v f Nn . *S(f, 7i)) n . 
r (f > v) 1 for r (f ^ V ■ f 5* vT- 
r a «$ Vr for r (n } fA 
r (f < vV for r (v > f)" 1 . 

If we want to set up the general notation of Arabic numerals in 
systematic fashion, we first introduce the digits ‘O’, ... ,‘9’ in 
the manner begun in D36-38; then we define xy or x~y (not to be 
confused with x X y) in such a way that, where x and y are any 
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natural numbers, x"y proves to be the number whose designating 
numeral under the usual Arabic notation consists of that of x 
followed by that of y. A definition to this purpose is as follows: 

or r (rvV for 

r (.V + (t X ((S‘9) A 0a)G8)((S'9) AP> V ./3>0.^./3 } a))))l 
The operation so defined is associative except where ‘0’ appears in 
middle position; e.g. (58)3 and 5(83) are both 583, and (50)3 is 
503, but 5(03) is rather 53 (03 being 3). But we may dispense with 
further derivation of theorems; the preceding developments 
illustrate the foundations of the arithmetic of natural numbers 
sufficiently for present purposes. 

Proofs of |710-f718 which proceed by mathematical induction and strongly 
resemble the above elliptical proofs in outward appearance have been usual since 
Grassmann (1861). 


§ 50. Ratios 

IN THIS section and the next two the derivation of further notions 
of quantitative mathematics will be outlined. The presentation 
will be sketchy, and progressively sketchier as it proceeds. No 
more theorems will be proved. 

Let us first take up the notion of ratio. One natural number is 
said to stand in the ratio x/y to another natural number when the 
one number multiplied by y equals the other multiplied by x. The 
number 8, e.g., stands in the ratio % to 12; and 12 stands in the 
ratio % to 8. In general, x/y is the relation of any natural number 
z to any natural number w such that y Xi = tX w} 

r (f/’?)" 1 for r aj3(a, (3 t Nn . i) X a = f X fl) 1 . 

Like the natural numbers which they relate, ratios themselves 
admit of an obvious ranking in point of greater and less. One 
ratio is greater than another if the number which bears the one 
ratio to a given number is greater than the number which bears the 
other ratio to the given number. To determine the relative rank 

1 This version of ratios is essentially Peano’s (1901, pp. 54f). 
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of two ratios x/y and x'/y' we first find a natural number w be¬ 
longing to the converse domain of both ratios; i.e., a natural 
number w such that some natural number z bears x/y to w and 
some natural number z' bears x'/y' to tv. Then we rank x/y as 
greater or less than x'/y' according as z is greater or less than z'. 

A number w of the required kind is readily found; namely, 
y X y' ■ There are natural numbers z and z' which bear the re¬ 
spective ratios x/y and x'/y' to y X y' , namely the numbers 
x X y' and x' x y. For, by definition, to say that x x y' bears x/y 
to y X y' is to say merely that 

y x (x x y') = X X (y X >•'); 

and to say that x' X y bears x'/y' to y x y' is to say merely that 

y' X (x' x y) = x' x (y X y')- 

To judge whether x/y is greater or less than x'/y’ we have therefore 
only to compare these particular numbers z and z', namely x X y' 
and x' X y. The ratio x/y is greater than x'/y' if and only if 
x X 3 / > x' x y. 

The analogue of ‘ > ’ for ratios will be rendered %>’;'* thus 
(x)(y)(x')(y') x/y s > x'/y' . = . x X y* > x' x y. 

The following is the formal definition. 
r (f s> V) 1 for 

r (3aO(30)(3y)(3S)(f = a /jS . V = 7 /5 . a X 5 > 7 X jS)" 1 . 

The ratios 0/1, 1/1, 2/1, 3/1, etc. will be referred to briefly as 
s 0, s l, s 2, s 3, etc.; thus we define: 

r .T n for r (r/D n . 

All ratios of this sort are functions with the range of functionality 
Nn. s 2 is the function "double of”, s 3 is "triple of”, and so on. 
S 1 is simply 

xy(x t Nn . x = y), 

i.e., identity confined to natural numbers. s 0 is 

1 The choice of the letter ‘s’ is arbitrary; ‘ r’ would have been preferable, but it 
is reserved for use in the arithmetic of real numbers (§ 51). Whitehead and Russell 
afford a precedent for the use of‘s’, in conn-ection with * + ’ and ‘x’ (vol. 3, *305). 
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xy(x = 0 . y t Nn), 


i.e., the relation which 0 bears to every natural number; in other 
words, the “constant function” whose value is 0 for all natural 
numbers as arguments. All this is readily seen by checking back 
over the definitions and making a few obvious transformations. 

The ratios s 0, s l> s 2, etc. fall into the same order as do the 
natural numbers 0, 1, 2, etc.; i.e., 

(x)(y) e x s > s y . = . x > y. 


But between each of these ratios 8 x and the next, viz. s (S‘.x), in¬ 
finitely many further ratios are packed which have no analogues 
among the natural numbers; between s 2 and s 3, e.g., we have 
5/2, 7/3, 8/3, 9/4, 11/4, etc. without end. 

It is apparent from the definition of r f/i? 1 that 1/0, or indeed 
z/0 for any natural number z except 0, is simply the relation 
xy(x t Nn . y = 0) which every natural number bears to 0; and that 
0/0 is the relation xy(x, y e Nn) which holds between all natural 
numbers. If we count these as ratios, then 1/0 will be the greatest 
of all ratios, whereas 0/0 will fall nowhere in the less-to-greater 
series; for 


WOO 

i.e. (x)(y) 

and W(y) 

i.e. (x)(y) 


x, y e Nn ,y^0.D.lXy>xx0, 
x, y e Nn . y ^ 0 .3. 1/0 s > x/y, 

~(0 x y > x X 0) . ~ (x X 0 > 0 X y), 
'—'(0/0 s> x/y) . ~ ( x/y 8 > 0/0). 


It is more usual, however, not to count 1/0 and 0/0 as ratios. The 
class Ra of ratios is therefore defined thus: 


1 ‘Ra’ for ‘x(3y)(3z)(y e Nn . z > 0 . # = y/z)’. 

Pressing the analogy which began with the introduction of 
‘s> V‘ s 0’> ‘b 1’> etc., it is easy to define an analogue ‘ s +’ of ‘-f ’ 
and an analogue ‘ 5 x’ of ‘x’. If z and z' are natural numbers 
bearing the respective ratios x/y and x'/y' to a natural number w, 
then the rational sum {x/y) s + ( x'/y') is explained as the ratio 
which the natural sum z + z' bears to w. Suitable choices of z, z' 
and w have already been remarked; we have seen that x X y' and 
x' x y bear the respective ratios x/y and x'/y’ to y x y'. Hence 
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{x/y) s + (x'/y') is describable simply as the ratio of ( x X y') + 
(x' x y) to y x y'; in short, as 

((* X y') + (*' x y)) / (y x y')- 

The rational product (x/y) a x (x'/y') is construed yet more 
simply, viz. as the ratio borne to y X y' by the product x X x' 
itself; in short, as 

(x x x') / (y X y'). 

The formal definitions are as follows. 

r (f s+ v) 1 for r G7)(3a)O0)(3«')(3/3')(f = a/0 . v = . 

y = ((aX fi') + (a' X 0)) / (0 X /3')) 1 - 

r (f sX n) 1 for r (i7)(3a)(3/3)(3a')(3/3')(f = <*/P ■ V = ct'/f}' • 

7 = (a X a) / (0 X /3')) 1 . 

It turns out that rational addition and multiplication as defined 
reproduce the familiar formal laws of natural addition and multi¬ 
plication, and it turns out further that they yield parallel results 
numerically when applied to the analogues s 0, ,1, etc. of the 
natural numbers. s 5 s + s 7 = s 12, e.g., and s 5 S X s 7 = s 35. In 
general, 

(x)(y) *x s + s y = ,(a: + y), 

(*)(y) S X s y = s (x x y). 

If we disregard the subscripts, we have in rational arithmetic 
simply a broader arithmetic of addition and multiplication in which 
the arithmetic of natural addition and multiplication is embedded 
as an integral part. 

One feature which puts rational arithmetic in marked contrast 
to natural arithmetic is the efficacy of division. The quotient 
x -=r y could indeed have been defined in natural arithmetic, viz. 
as (iz)(x = y X z); but it would have been of little use, for there is 
no number whose product by y is x except in those relatively rare 
cases where x is a multiple of y. Where x and y are ratios, on the 
other hand, there is almost always one and only one ratio z such 
that x = y ,X z; always, in fact, except where y = a 0. Thus it is 
that the definition: 


r (f v) 1 for r (ia)(f = ij 8 X a) 1 
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of rational division is useful indeed. We find, in fact, that x s -h 
( f w/w') is simply x S X (iv'/iv). 

Along with this gain in the matter of division, however, we en¬ 
counter a loss in connection with the notion of power. Let us 
consider how a rational analogue ‘ Sj A’ of the ‘A’ of natural 
arithmetic might appropriately be construed. The suitable version 
of (x/y) „A s 2, in particular, is obvious; just as w A 2 is w x w, 
so we may appropriately take ( x/y) S A s 2 as (x/y) s x (x/y). 
Thus 

(x/y) S A »2 = (x/y) S X (x/y) = (x X x) / (y X y) = (x A 2 )/(y A 2). 

Similarly 

A/y) sA a l = (x A l) / (y A l) = #/y. 

More generally, we may always explain (x/y) S A s *' as 
(x A *') / (y A *')■ This accounts for all rational powers of the 
form (x/y) ,A A'/l); but it remains to consider (x/y) 5 A 
(x'/y') where y' ^ 1. Let us then examine the case w s A(l/2), 
where w is any ratio. We shall want to preserve the familiar laws 
of exponents which hold in natural arithmetic; thus 

(w 8 A(l/2)) s X (w s A( 1/2)) = w S A ((1/2) s + (1/2)) = w S A .1 - 

But, as just previously decided, w „A T = w. Hence w S A (1/2) 
must be explained as the ratio z such that z s x z = tv. The diffi¬ 
culty is, however, that there will very commonly be no such ratio 
z. E.g., there is no ratio z such that z s x z = s 3; thus ,3 S A (1/2) 
is missing. 

Whereas the notion of power in natural arithmetic works for 
2 ny two natural numbers, the analogue for rational arithmetic 
fails in the observed fashion for many ratios. When the second 
ratio happens to be integral, i.e. of the form the power is indeed 
forthcoming; but if the second ratio is not integral, the power is 
missing more often than not. Rational arithmetic is fully as de¬ 
fective with respect to the notion of power as was natural arith¬ 
metic with respect to division. 
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§51. Real Numbers 

THE MISSING powers in rational arithmetic can nevertheless be 
approximated by ratios, to within any preassigned interval of 
accuracy. Whereas e.g. there is no ratio s 3 s / v (l/2), in other 
words no ratio z such that z S X z = s 3, there are nevertheless ratios 
z such that z 3 X z falls within any preassigned neighborhood of s 3. 
Thus, consider the class of all those ratios z which are small enough 
so that a 3 a > z 5 X z. All these ratios are smaller e.g. than 7/4; 
but within the class there is no largest. By choosing larger and 
larger ratios z from this class we can bring z S X z closer and closer to 
s 3, without end. 

The class just now referred to has these four features. (1) Its 
members are ratios. (2) It does not exhaust the ratios. (3) It 
has no greatest member. (4) It contains all ratios except those 
which are too large; i.e,, a ratio falls outside the class only if it 
exceeds all members of the class. These four features can be 
formulated together in the following compact fashion: the class is 
not the class of all ratios, and anything z belongs to the class if and 
only if y s > z for some y belonging to the class. A class of this 
kind is called — by caprice, we may for the moment suppose — a 
real number. 1 Thus the class Nr of real numbers is definable as 
follows: 

‘Nr’ for ‘ x{x 9^ Ra . x = z(3y)(y e x . y s > z)) ’. 

The class of all ratios less than a given ratio is a real number; 
for it obviously has the four features noted above. Corresponding 
to each ratio x, thus, we have a real number y(x B > y). The latter 
will be referred to briefly as r x. 

r r f for r &(f s > a) 1 

1 The fact that such classes constitute a model of the traditional real number 
system was pointed out by Dedekind {Stetigkeit). The outright identification of 
the real numbers with those classes was first explicitly propounded by Russell 
(Principles , Ch. XXXIII), though rather hinted previously by Peano (“Sui nu- 
meri”). 
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But we also have real numbers which do not thus correspond to 
ratios; one such is the real number originally considered above, 
viz. z( s 3 „> z „X z ). There is no ratio x such that this real number 
is px; in other words, no ratio x which is just next greater than all 
ratios belonging to this real number. If there were such a ratio, in¬ 
deed, it would be our missing power „3 S/ A (1/2). Real numbers 
which correspond thus to no ratios are called irrationals. 

‘Irrat’ for ‘x{x t Nr . (y)(x ^ r y))’. 

The others may be called rational real numbers. 

An irrational is a real number which, so to speak, corresponds 
to a missing ratio. The real number z( s 4 3 > z „x z) is a rational 
one, for it is jX where x = „2; it corresponds to the ratio 8 2. On the 
other hand the irrational z(„3 s > z S X z) fills in where the series of 
ratios exhibited only a lack: the lack of a rational power „3 S/ A( 1/2). 

A general ranking in point of greater and less is readily imposed 
on real numbers, in such a way that any rational reals pX and T y 
will follow the same order as the corresponding ratios x and y. We 
have merely to reflect that T x will include r y if and only if x is 
greater than or equal to y. Thus ‘r>’ and ‘ r > the analogues of 
‘ ^ ’ and ‘ > ’ for real numbers, are defined as follows: 

r (H^vr for r (f, i?« Nr . 77 C f)" 1 , 
r (r r > V) 1 for r (f r £ n.t *r,y. 

The series of real numbers, arranged in point of greater and less 
in this sense, images the series of ratios except that new numbers, 
the irrationals, are squeezed in everywhere. Between the series of 
ratios z such that s 3 ,>z,Xj and the series of ratios z such that 
s,Xz,> 8 3, for example, there is no further ratio; but between the 
real numbers corresponding to the former ratios and the real 
numbers corresponding to the latter ratios there is an intermediate 
real number, viz. the irrational z( s 3 „> z «X z). 

A ratio is called an upper bound of a given class of ratios if no 
member of the class exceeds it. A real number is called an upper 
bound of a class of real numbers under similar circumstances. Now 
a class of ratios may have an upper bound without having a least 
upper bound. This is true e.g. of the class z( a 3 „> z S X z) hitherto 
discussed; 9/5 is one ratio which no member of the class exceeds, 
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but 7/4 is a smaller ratio having the same property, and there are 
smaller and smaller ones without end. On the other hand every 
class of real numbers which has an upper bound has a least one. 
Consider e.g. the class analogous to the above class of ratios, 
viz. the class of all real numbers z such that r 3 r > z r X z; this 
class has the irrational number z( s 3 „> z 8 X z) itself as least upper 
bound. In general, indeed, it is readily verified that if x is a class 
of real numbers which has any upper bound at all then its least 
upper bound is simply the class Q“x of all members of members 
of x. 

The fact that every bounded class of reals has a least bound is 
the basic formal difference between the reals and the ratios. It is 
in this sense that the series of real numbers is said to be continuous 
while the series of ratios is not. On the other hand both series are 
dense, in the sense that there is a third number between any two. 

If a bounded class x of ratios lacks a least bound, then the class 
x' of those real numbers r z which correspond to the ratios z in x has 
an irrational least bound. Conversely, also, every irrational is the 
least bound of a class x' of the above kind. This is the strict mean¬ 
ing of the earlier loose remark to the effect that an irrational is a 
real number which corresponds to a missing ratio. 

The irrationals do far more than supply missing powers. One 
familiar irrational of another sort is it. The irrationals exist in 
such variety, indeed, that no notation whatever is capable of 
providing a separate name for each of them. 1 Note in contrast that 

1 The individual signs available in any given notation are finite in number — 
say Hi, fii, • ■ ■ > Mi- (In the present notation we have indeed an infinity of vari¬ 
ables, but they are made up of the five signs ‘w’, ‘x’, ‘y’, ‘z’, The infinite 
totality of expressions which can be constructed from Hi ,...»Mi can be ordered 
thus: Ml. ■ ■■> Mi. r MiMi n . r MiMi 1 , '"msMi" 1 , •••, r MiMi n . r MiMiMi 1 , 
r /i,MiMi~ l > r MiM 2 Mi n . • - - Let us refer to them briefly as Mi. • • • .Mi.Mi+i. • • • • Con- 
sider next the unending decimal expansion <(V> of a positive real number x ^ 1. 
Because unending, <(xy must be thought of not as an expression but as a func¬ 
tion, such that n is the number ^ 9 which turns up in «th place on expand¬ 
ing x. To say that \x/ is unending is to say that for every number there is a larger, 
n, such that \xy n p* 0. We know that every positive real number^ 1 has a unique 
unending expansion (as well as sometimes an ending one; e.g., .43 = .42999 .. .). 
Conversely, also, each such expansion determines a positive real number ^ 1, Then 
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any ratio can be named by putting ‘/’ between numerals. 

We turn now to the problem of defining multiplication and addi¬ 
tion for real numbers. Our object will be to maintain the laws: 

(1) ,X ,X r y = r 0 S X y), r* r+ r) 1 = r(* S + )’) 

for all ratios x and y, just as in defining ‘ r > ’ our object was to main¬ 
tain the law: 

r* r > rV • = - * 5 > V- 

Now clearly the following is true for all ratios ( s 0 and greater): 

(2) x s x y s > z • — (3*')(3y')0 s > x '. y 5 > f . z = x' s x y')- 
I.e., since zv s > z . = . z t r u >, 

z t t(x s x y) .= (3*')(3y')(*' «• y't ,y . z = x’ s x y'). 

In view of (1), then, we are to identify T x r X r y with 

z(3x’)(3y')(x't r x . y't ,y . z = x' S X y'). 

So, if we treat irrationals on a par with r x and r y, 

r (f rX ny/or r a(3/3)(3 t)(/3 t f . y t r) . a - j8 S X y) n . 

Addition is more complex, because the analogue of (2) for addi¬ 
tion can fail when x or y is s 0. What holds in lieu of (2) is this: 

*,+ y,> s . = : 

x s > z .v. y s > z .v {3x'){3y'){x s > x’ . y s > y‘ . z = x' s + y'). 
I.e., zt T (x s +y).^l 

K r t.v,ie r y .v {3x’){3y')(x't ,x . y't r y . z = x' s + y')- 

Recalling (1), then, we are led to a definition of ‘ T x r + ,y’ which, 
if extended beyond t x and r y to real numbers generally, runs 

consider the real number k which is determined as follows: for each n, (i)‘« is 1 
or 2 according as^< B does or does not name a positive real number x ^ 1 such that 
yy n — 2. Now k has no name n n ; for, if jr n named k, <^)>‘ n would have to satisfy 
the self-contradictory condition of being 2 if and only if other than 2. It is thus 
established that no notation is adequate to expressing every real number. Since 
each rational number is readily expressed with help of the fractional notation,it 
follows that no notation is adequate to expressing every irrational. (This argument 
is due in essential respects to Cantor.) 
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as follows: 

r (f r+ n) 1 for r (f w rt w a(3l3)(3y)(P t f . 7 e 1 } . a = /3 s + 7))" 1 - 

The definition of power r (f r /*■ rf) 1 for real numbers will be passed 
over in this survey. 

It was observed that natural arithmetic is virtually, but not 
actually, embedded in rational arithmetic. The whole numbers in 
rational arithmetic are not 0, 1, 2, etc., but rather the rational 
analogues 5 0, s l, a 2, etc. Clearly the relation of rational to real 
arithmetic is similar; the rational reals are not ratios, but analogues 
ri * of ratios x. The whole real numbers, likewise, are not the 
natural numbers 0, 1, 2, etc.; they are rather real-number ana¬ 
logues ra 0, „1, rs 2, etc. of the rational analogues s 0, ,1, „2, etc. of 
0, 1, 2, etc. (Note incidentally that r8 0 = A.) But when we are 
concerned with using arithmetic or investigating its higher reaches 
rather than examining its logical foundations, we of course find it 
convenient to leave these distinctions tacit and omit the subscripts. 


§52. Further Extensions 

FOR ALL REAL numbers x and y, there are real numbers * r + y, 
x r X y, and * r Ay; in this respect the arithmetic of real numbers 
is like that of natural numbers. For all real numbers x and y such 
that y ^ „0, moreover, there is a real number x r +y whose 
product by y is *. 

r (i'r- f -I ))" 1 for r (’«)(f = ViX “) n - 

In this respect the arithmetic of real numbers is like that of ratios. 
And the arithmetic of real numbers outdoes both that of natural 
numbers and that of ratios in the following respect: for all real 
numbers x and y such that y 9 ^ r8 0 there is a yth root of x, symboli¬ 
cally y r V x-, i.e., a number whose yth power is #. 

r (f I V vV for r (ia)(jj = a r Af)I 

This notion of root, which is inverse to power just as quotient is in¬ 
verse to product, admits equivalently of definition in terms of quo¬ 
tient and power: 



276 


NUMBER 


§52 


r (frV / »~' for r (n rA (r.l r + f))" 1 . 

Since in the arithmetic of real numbers the notions of product 
and power thus have inverses which are almost always applicable 
(always save in the zero case), we are led to look also for a generally 
applicable notion of subtraction inverse to addition; but here the 
arithmetic of real numbers, like that of natural numbers and that of 
ratios, is found wanting. It is not true in general, given any real 
numbers x and y, that there is a real number x r — y whose addition 
to y yields x; there is such a number only if x y. 

Just as the power defect in rational arithmetic was overcome by 
passing to real arithmetic, so the subtraction defect in real arith¬ 
metic is overcome by a further extension: introduction of negative 
numbers. Under this extension every real number x except r5 0 
gives way to two numbers: a positive one + x and a negative one 
-x. Only rs 0 is left unsplit; + r8 0 and _ r8 0 are identified. The 
branches + x and of a real number x are construed, further, as 
distinct from the branches + y and _y of any other real number y. 

Entities + x and _x fulfilling the above requirements come readily 
to hand; we can take + x as \ y (x r + y), i.e. as the function of ad¬ 
dition of x in the sense of the foregoing arithmetic of real numbers, 
and we can take as the converse of that function. 

r +r for r X a (f r +a)T 
Tf 1 for r “+f 1 . 

These new numbers, real numbers in a broader sense, may be called 
signed real numbers ; real numbers in the previous narrower sense 
may then be distinguished from these as unsigned real numbers. 
The class NR of signed real numbers is definable in obvious 
fashion: 

‘NR’ for ‘i(3y)(y e Nr : x = + y .v. x = _y) h 1 
The integers form a subclass of NR, defined thus: 

‘Int’ for ‘x((3y)(y e Nn : x = + rs y .v. x = _ „y)\ 

1 This construction of the signed reals from the reals is modelled on Peano’s 
construction (1901, pp. 48-49) of the signed integers from the natural numbers. 
Peano urges that the essential idea was entertained more or less clearly by Mac- 
Laurin and Cauchy. 
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Signed real numbers are ranked in point of greater and less in 
such a way that those of the form + x follow the same order as the 
corresponding unsigned reals, while the others follow the reverse 
order and are reckoned as smaller. Thus the series of signed real 
numbers constitutes an image of the unsigned series and a reverse 
image of the same laid end to end, with the zero as their point of 
contact. A definition which orders the signed real numbers in the 
described fashion is this: 

r (t R> v ) 1 for r (f, V e NR . (3a) (f 'a r > 

A signed real number x is called positive if x r> + r3 0; negative 
if + rs 0R> x. So far as + rs 0 and the positive signed reals are con¬ 
cerned, this new arithmetic is to constitute an exact reproduction of 
the previous arithmetic of unsigned reals. Even throughout the 
negative signed reals, moreover, the notions of sum, product, and 
power are to conform to the same basic arithmetical laws. Defi¬ 
nitions of sum and product which prove to meet these requirements 
are as follows: 

r (f r+ v ) 1 for r ((t h) w (ij | f)) 1 , 

r (f rX j ?) 1 for 

r (ia)Q0){3y)($ = +0 • v - +y -v. f = -0 . v = - 7 : « = f(0 r x 7 ) : 
v: t = +0 . V = -7 -V. f = -0 . V = +7 : a = -03 r x 7)) 1 - 

The general applicability of subtraction is achieved. For all 
signed real numbers x and y there proves to be a signed real number 
x r— y, in the sense: 

r (f R- l )" 1 for r (ia)(f = r/ r+ a) n . 

It turns out, indeed, that v r— y is simply x r+ _z or x r+ ac¬ 
cording as y is + z or _z. Division survives as in rational and un¬ 
signed real arithmetic; for all signed reals x and y such that y ^ 
+ r8 0 there is a signed real x r-f y in the sense: 

r (f r-f v) n for r (?a)(f = n kX a) n . 

But the notion of power presents a difficulty. When we passed 
from natural numbers to ratios, we found that the gain with regard 
to division was offset by a loss with regard to the notion of power; 
and in signed real arithmetic the gain with regard to subtraction 
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proves to be similarly offset. It turns out that if we define ‘ R A ’ 
in the way in which we would have to define it in order to meet the 
requirements mentioned above, there will frequently be no such 
signed real number as x R /^ y when x is negative and y is not an 
integer. 

This shortcoming proves to be attributable ultimately to the 
fact that there is no signed real number z such that z R x z = _ rS l. 
A still further arithmetic is thus prompted, the arithmetic of com¬ 
plex numbers, which does contain the analogue of such a number z. 
Complex numbers are construed simply as pairs x;y of signed real 
numbers; and the notions of sum, product, and power are defined 
in such a way that the arithmetic of signed real numbers is imaged 
within a part of the new arithmetic. In this arithmetic of complex 
numbers, as in that of unsigned reals, the notions of sum, product, 
and power all prove applicable without restriction; also, as in the 
arithmetic of unsigned reals, the inverse operations of division and 
extraction of root prove applicable save in the zero case; and also, 
as in the arithmetic of signed reals, subtraction applies without 
restriction. 

Analysis , which includes the differential and integral calculus, 
depends in its basic developments not upon the arithmetic of com¬ 
plex numbers but upon that of signed reals. The fundamental 
notion of analysis is that of the limit of a function. A (signed real) 
number z is said to be the limit of a function x for arguments approach¬ 
ing y from below, symbolically Iim x, if for every positive number z' 

there is a number y’ which is less than y and such that the value of 
x differs from z by less than z' for any argument between y' and y. 
Rendered symbolically, this characterization of z assumes the form: 

(z')(T > 0 .3 (3v')(y > /■ 

(w)( y > iv . w > y' .3. ( x‘w ) -f z' > z . z + %' > x‘w))). 

Thus 

r lim f " 1 for r (Ta)(l3)(l3 >0.3 (3-y) (17 > y . 

(5) (77 > 5 . 8 > y .3. ((■'£) + /3>a!.a! + /3> f'S)))" 1 . 
The signs ‘ >’, and ‘ T, and ‘ 0 ’ here are of course intended as 
‘ R > ’, ‘ R +\ and ‘_|_ rs 0 ’; but such subscripts are conveniently 
dropped in discourse which concerns only one kind of number. 
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The limit of x for arguments approaching y from above, sym- 

y 

bolically lim x, and the so-called limit of x at infinity, sym¬ 
bolically lim x, admit of analogous definitions. For the one we 

03 

change ‘tj > y’ in the above definition to ‘7 > 7 /’ and ‘77 > S. 6 > 7 ’ 
to ‘7 > 5. 5 > 7 )’; for the other we simply drop ‘17 > 7 ’ and 
‘rj > S’. 

By way of concluding this sketchy survey of the route from logic 
into quantitative mathematics, the definition of the derivative Dx of 
a function x of one argument is now set down without discussion. 

r DA for r \ a lim \ 0 m'p) - «'<*)) + (0 - a))\ 

The derivative of the function “square of”, e.g., proves to be the 
function “double of”. 

D\Jx A 2) = \(2 x *), 

The derivative of any “constant function” (cf. § 41) proves to be 
the constant function whose uniform value is 0 . 

(y) DKy = M- 

Abstract algebra is one branch of mathematics which lies aside from the direction 
of derivation sketched in the latter sections; and geometry would appear to be 
another. Actually the place where abstract algebra fits into the general structure 
of logic is farther back, namely within the general theory of relations and functions 
whose groundwork was studied in Chapter V. As for geometry, a method of re¬ 
duction to logic is ready at hand in the simple expedient of identifying geometrical 
entities with those arithmetical entities with which they are correlated through 
analytic geometry (cf. Study, pp. 86-92). The more abstract, so-called non- 
metrical phases of geometry go over into correspondingly general phases of arith¬ 
metical analysis -—so general, in the extreme case of topology, as to transcend the 
specifically arithmetical domain and form part of the general theory of relations 
and functions. This view of geometry seems to be the most convenient one in an 
account of the application of geometry to nature. A scale of measurement, whether 
of distance or of temperature, etc., may be viewed as a scheme for the systematic 
assignment of numbers directly to observed objects or events (cf. Carnap, Physi- 
kalische BegriJ)sbitdung, and Jeffreys, Ch. VI); and then an empirico-geometrieal 
statement, e.g. to the effect that Boston, Albany, and Buffalo are in line, may be 
analyzed as attributing a certain arithmetical relationship to the numbers which 
have been assigned to given objects under the scheme of distance measurement. 
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§5.3. Formality 1 


VARIABLES were described earlier (§ 12) as comprising just the 
letters V, ‘x’, ‘y ’, and ‘z’ with or without accents. The atomic 
logical formula were described in turn (§ 23) as comprising just the 
results of putting such variables in the blanks of ‘ ( e )’. Then 
the logical formula were described as comprising, first, the atomic 
logical formulae; second, all results of putting such expressions in 
the blanks of ‘ ( j. )’or after a parenthesized variable; third, 
all results of putting expressions of the thus supplemented totality 
in the blanks of ‘ ( j )’or after a parenthesized variable; and 
so on. Now all these characterizations are formal, in that they 
speak only of the typographical constitution of the expressions in 
question and do not refer to the meanings of those expressions. 
But this explanation of ‘formal’ is vague; we turn now to a more 
precise version. 

Let us use ‘Si’, ‘S 2 ’, . . . , ‘S9’ as names of the respective signs or 


typographic shapes ‘w’ 

, VyV> v, "’, ‘(\ T, ‘ 1 and V; thus 

Si - 

‘ w’ = double-yu, 

s 2 = 

‘ x’ = ex, 

s, = 

V = wye, 

s 4 = 

‘ z ’ == zee = zed = izzard, 

s 6 = 

"’ = accent, 

Ss = 

*(’ = left parenthesis, 

S 7 = 

“)’ = right parenthesis, 

s 8 = 

‘ J. ’ = down-arrow, 


S9 = ‘e’ - epsilon. 

Further, let us use the arch ‘ ’ to indicate concatenation of expres¬ 
sions. Thus Si S 5 is the expression ‘ w' ’ which is formed by writing 

1 This concluding chapter will be unintelligible to those readers in whom there 
is a lingering tendency to confuse use and mention of expressions (§ 4). I have not 
seen how to make the chapter less liable to misunderstanding except at the expense 
of a disproportionate,increase in length. Inasmuch as the foregoing six chapters 
constitute a self-contained unit, it has seemed best to limit this seventh one to a 
modest appendage amputable at discretion. 
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Si and then S 5 ; again S 9 S 2 is ‘a’; again (Si S 5 ) (S 9 S 2 ) is the 
result ‘ w' e x’ of writing Si S 5 followed by S 9 S 2 . But parentheses 
are not needed in connection with ‘ since concatenation is ob¬ 
viously associative; thus 

Si S5 S9 S2 = ‘ w’ ex’. 

The arch ‘ ’ and the sign names ‘Si’, ‘ S 2 ‘ S 9 ’ provide 
a notation for spelling; ‘Si S 5 S 9 § 2 ’ may be read ‘the expression 
consisting of Si followed by S 5 followed by S 9 followed by S 2 \ in 
other words ‘the expression consisting of double-yu followed by 
accent followed by epsilon followed by ex’. 

Into this rudimentary notation, plus the notation of logic, it is 
possible to translate 1 x is a variable’ in the sense verbally defined 
above; also ‘ x is an atomic logical formula’; also ‘x is a logical 
formula First it is convenient to introduce a string of preliminary 
notions. An expression x constitutes an initial segment of y, or be¬ 
gins y, symbolically xBy, if y consists either of x alone or of x 
followed by something; thus 

(x) (y) xBy ,m : x = y .V (3z)(x z = y). 

Similarly x is a final segment of y, or ends y, symbolically xEy, if 
y consists either of x alone or of x preceded by something. 

(x)(y) xEy . = . x = y .v (3z)(z"x = y). 

Now x is a string of accents, symbolically Ac x, if every initial seg¬ 
ment of x ends in an accent. 

(x) Ac x = (y)(yBx .D. SsEy) 

The result of writing a left parenthesis followed by x followed by 
epsilon followed by y followed by a right parenthesis will be called 
xey. 

(x) (y) xey = S 6 x S 9 y S 7 . 

When l ’ is used instead of epsilon, the result is called xjy. The 
‘j’ is intended to suggest joint denial. 

(x)(y) xjy = S 6 x S 8 y S 7 . 

The result of putting an expression x in parenthesis and prefixing it 
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to y is called a qu y. The letters ‘qu’ are intended to suggest uni¬ 
versal quantification. 

(x)(y) x qu y = S 6 x S 7 y. 

Now a is a variable, symbolically Vbl x, if x is any one of Si, 
S 2 , S 3 , or S 4 with or without a string of accents attached. 

( x ) Vbl x = (3y)(y = Si .v. y = S 2 .v. y = S 3 .v. y = S 4 : 

x = y M (3z)(Ac 2 . x = y z)). 

Further, where ‘LFmla 0 x’ is read ‘x is an atomic logical formula’, 

(x) LFmlao x = (3y)(3z)(Vbl y . Vbl z . x = yes). 

If x is formed by prefixing a parenthesized variable to y, x is 
said to be a quantification of y — symbolically xQy. 

(x) (y) xQy .= (3z)(Vbl z . x — z qu y). 

Now the logical formulae are to comprise the atomic logical formulae 
together with all joint denials and quantifications thereof, and all 
joint denials and quantifications of the thus supplemented totality, 
and so on. By reasoning analogous to that which led to the 
definition of the ancestral (§ 39), then, we see that x is describable 
as a logical formula just in case it belongs to all classes y of the 
following kind: the atomic logical formulae belong to y, and the 
joint denials and quantifications of members of y belong to y. 
Thus 

( 1 ) (.*) LFmla a = (y)((z)(^)(LFmla 0 z.D.zey : 

z,wty.D. zjw ty : z t y . zv Q z .D. w t y)D. x t y) 

where ‘LFmla x’ is read ‘x is a logical formula’. 

The so-called formal characterizations of ‘variable’, ‘atomic 
logical formula’, and ‘formula’ which appeared in words at the 
beginning of the section thus admit of translation into terms of 
‘Si’, ‘S 2 ’, ■ . ■ , ‘S 9 ’, ‘ ’, and our logical notation. Whatever is 
“formal”, in the sense intended by the earlier loose phrase ‘speak¬ 
ing only of the typographical constitution of the expressions in 
question’, admits of similar translation; though of course the sign- 
names ‘Si’, ‘Sa’, . . . , ‘S 9 ’ have to be reconstrued and perhaps al¬ 
tered in number to suit the particular notation under discussion. 
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We are thus led to the following more rigorous criterion of for¬ 
mality: translatabilitv into a notation containing only names of 
signs, a connective indicating concatenation, and the notation of 
logic. 

Discourse which is “formal” in this sense, and hence translat¬ 
able into the notation just now described, is called metamathematics, 
formal syntax , or briefly syntax. Since joint denial, quantifi¬ 
cation, and the membership notation suffice for logic, these same 
three devices plus names of signs plus a notation indicating con¬ 
catenation suffice for syntax. (A still more economical notation to 
the same purpose will appear in §54). The fact that 1 Vbl 
‘LFmlao’, and ‘LFmla’ are definable in this syntactical notation 
is perhaps best expressed henceforward by speaking of them as 
syntactically definable; the word ‘formal’, having served the pur¬ 
pose of providing a first vague indication of our present concerns, 
can thus be left unencumbered with technical meaning. 

Any definition is stated with help of syntax, as a notational 
convention of abbreviation; but to say that an expression is 
syntactically definable is to say something more, namely that it is 
explicable as an abbreviation of an expression which is itself com¬ 
posed of just the notations of syntax. 

The general notion of formula was explained in § 13 in terms of 
an unspecified notion of atomic formula. Where ‘Fmla*’ means 
' x is a formula’ and ‘Fmlar, x’ means ‘x is an atomic formula’, 

(x) Fmla x = (y)(W(«')(Fmla 0 z .D. z e y 

z, w t y .D. zjw ( y Izty.wQz.D.wt y)D. x ty) 

in exact analogy to (1). Thus ‘Fmla x’ is syntactically definable in 
terms of ‘Fmla u ’; i.e., it is translatable into a notation comprising 
r Fmla 0 a 1 and nothing further except names of signs, the concatena¬ 
tion connective, and logic. As will become apparent in the sequel, 
moreover, the notions of matrix, tautology, and theorem intro¬ 
duced in foregoing chapters have the same status as the notion of 
formula; they are all syntactically definable in terms of r Fmla () a n . 
If we specify the atomic formulae, so that r Fmla n a 1 is itself syn¬ 
tactically defined in one way or another, then all these notions be¬ 
come syntactically definable in the absolute sense. If e.g. the 
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atomic formulae are taken simply as the atomic logical formulae, 
then r Fmla a 1 reduces to r LFmla a 1 , whose syntactical definability 
was observed in ( 1 ). 


§54. The Syntactical Primitive 

MUCH OF the discussion in previous chapters, including indeed all 
metatheorems, belongs to syntax; to logic or mathematics in the 
narrower sense belong rather the theorems themselves, some of 
which were stated and some of which were syntactically described. 
Our medium of syntactical discussion hitherto has been ordinary 
language, supplemented with the practical device of Greek letters 
and corners. But now we have a syntactical notation which is just 
as strict and systematic as the logical notation whereof it treats. 
It consists of the logical notation (the very notation whereof it 
treats) plus ‘Si, ‘S 2 ’, . . . , ‘S 9 ’, and 

Corners thus give way to a more analytic expedient, use of the 
concatenation sign; and Greek letters give way to the ordinary 
variables ' x’, ‘y f , etc., adjuncts of the quantifier (cf. § 12 ). It is to 
be noted that these variables do not need to be reconstrued, for 
present purposes, as having the sense which formerly attached to 
the Greek letters; rather they are general variables, as always, and 
may refer to any entities whatever — expressions included. The 
quantification ‘(x)(x = x)’, e.g., has always meant not only that 
V = V and 5 = 5 but also that S 9 = S 9 (i.e., epsilon = epsilon) and 
even that S 2 = S 2 (i.e., ‘x’ = ‘ x')\ and the quantifier does not need 
to depart now from its usual sense. It happens indeed that the 
only values of the variables which interest us in syntactical appli¬ 
cations are expressions, just as in § 39 the only values of the 
variables which interested us were relations and in § 49 natural 
numbers. 

But our new syntactical scheme is capable of improvement. It 

1 This kind of approach, whereby the medium of discourse about a formalism 
receives strict formalization in turn, dates from Godel (1931) and Tarski (1933). 
The arch notation is Tarski’s. 
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is possible so to reformulate it that its primitive notation will com¬ 
prise no terms beyond variables; and there is an advantage, as ob¬ 
served (§27), in so doing. The concatenation sign ‘ like ‘ + 
etc., combines terms to form terms; e.g. it combines the 
terms ‘S 9 ’ and ‘S 2 ’ (which are names of ‘e’ and ‘ x’) to form the 
term ‘Ss S 2 ’ (which is a name of ‘ex’). But instead of this we may 
take as primitive a concatenation predicate ‘C’, or more accurately 
a primitive form of atomic matrix r Cafiy 1 , such that ‘Cxyz’ means 
‘x is the expression formed by writing the expression y followed by 
the expression s’; then y s becomes definable as (ix) Cxyz. Again, 
the primitive names ‘Si’ ‘S 2 ’, . . . , ‘S9’ are eliminable by adopting 
a primitive form of atomic matrix r aAj 8 n such that ‘xAy’ means 
‘x is the sign which alphabetically just succeeds the sign y’; for 
then 

51 = («)((3y)(yA*) . ~ (3y)(*Ay)), 

5 2 = (ix) x A Si, S 3 = (ix) x A S 2 , . . . , S 9 = (w) x A Ss, 

where the alphabetic order of our signs is taken arbitrarily as 
Si, S 2 , . . . , Sg. 

But further economy is possible. We can get along with a single 
primitive form of atomic matrix r MaPy^, if we endow ‘M.xyz’ with 
this elaborate meaning: if x is a single sign (Case 1 ), x alpha¬ 
betically just succeeds y (i.e., xAy); if * is a complex expression 
(Case 2 ), x is the result of writing y followed by z (i.e., Cxyz)\ and 
if x is not an expression at all (Case 3, uninteresting for syntax), 
x = y. The word ‘sign’ in this explanation is of course understood 
as referring exclusively to our chosen signs Si, S 2 , etc.; and ‘ex¬ 
pression’ refers exclusively to such signs and finite rows of them. 

What z may be is indifferent to Cases 1 and 3; hence if x is not 
a complex expression, the above account of ‘Marys’ guarantees 
that Mxyz = M xyx. Hence if on the contrary 

(1) Mxyz . ~ Mxyx 

we may be sure that a is a complex expression. But then (1) must 
be interpreted according to Case 2 ; i.e., as meaning that x is the 
result of writing y followed by z, and not the result of writing y 
followed by x. But the latter clause imposes no restriction at 
all; since an expression is understood as a finite string of signs 
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(one or more), an expression x could not be the result of writing an 
expression y followed by x again in its entirety. Hence (1) amounts 
merely to saying that x is the result of writing y followed by z; 
i.e., Qxyz. 

We have seen that, where x is a complex expression, (y) ~ Mxy.*. 
Where x is not an expression at all, Mxxx by Case 3. Hence if 

(2) Mxyx . ~ Mxxx 

we may be sure that x is an expression but not a complex expression; 
therefore a sign. But then (2) must be interpreted according to 
Case 1; i.e., as meaning that x alphabetically succeeds y and does 
not succeed itself. Since the latter clause is clearly vacuous, 
(2) amounts merely to saying that xAy. 

We see therefore that both ‘Cxyz’ and ‘xAy’ can be paraphrased 
in terms of ‘M’, viz. as (1) and (2). Accordingly y z, which was 
( 7 #) Cxyz, becomes describable in terms of ‘M’ as 

( ix ) (M#yz . ~ Mxyx ); 
and Si, S 2 , S 3 , etc. become 

(3) (7#)((3y)(My#y . ~ Myyy) . ~ (By) (Mary# , ™ M#x*))> 

(4) (Mr)(M#Si* . ~ Mxxx)., 

(5) (Mf)(M#Ss* . ~ Mxxx), 

etc. 

Actually the ‘~Mm’ in (4) is superfluous. The purpose of 
Mxxx ’ in (2) was to rule out Case 3 and thus compel x to be an 
expression; but this is already accomplished in (4) by the clause 
‘M#Si#’, since under Case 3 ‘M#Si#’ would identify x with the sign 
Si and thus compel x to be an expression after all. Thus (4) reduces 
to ‘ (7#) M#Si*’; and the analogous is true of (5) and its suite. 

In place of (3), moreover, the simpler description: 

(6) ( 7 *) (y) (z) ~M*yz 

will serve for Si. This is seen as follows. Where x is a complex 
expression, it has parts y and z such that x consists of y followed 
by z; therefore ‘ (y)(z)~M.rvz’ rules out Case 2. It also rules out 
Case 3, since ' ~Mxxx’ was seen to do so. Therefore x must be a 
sign if (y)(z)~M.*yz. But then ‘ (y)(z)'~M#yz’ says, according to 
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Case 1 , that the sign x is not an alphabetic successor; in other 
words, that it is Si. 

Our primitive syntactical notation now consists merely of ‘M’ 
and the notation of logic—ultimately ‘M’ and the notations of 
joint denial, quantification, and membership, since these latter are 
adequate to the logical matters. Just as the logical formulae 
comprised atomic formulae of the form r (a e ft) 1 and all the ex¬ 
pressions thence constructible by joint denial and quantification, so 
the syntactical formulae comprise atomic formulae of the forms 
r (a t 8) 1 and r Maj 37 n and all the expressions thence construct¬ 
ible by joint denial and quantification. 

When this syntactical notation is used for the discussion of ex¬ 
pressions composed of signs other than the particular nine with 
which we have been identifying Si, S 2 , • • . , S 9 , the meaning in¬ 
tended for ‘Mxyz’ will of course undergo a corresponding change 
in systematic fashion. The change takes place merely in the 
notion of alphabetic succession which is used under Case 1 in the 
above explanation; the relation of alphabetic succession varies with 
the alphabet. 

It should be remarked that ‘M’ and the abbreviations ‘Si’, 
etc. which are defined in terms of ‘M ’ are not a practical substitute 
for the former devices of quotation marks, Greek letters, and 
corners. The old devices are both more graphic and more con¬ 
ducive to brevity. And the new syntactical notation is not conven¬ 
ient at all for stating definitions, where signs enter which were not 
initially taken into account. The new notation has its value 
rather as a subject matter, when we come to talk about syntax 
rather than merely using syntax to talk about logic. As a medium 
of talking about this refined syntactical notation, in turn, the old 
practical devices of quotes, Greek letters, and corners recur; they 
have appeared to some extent in the present section and they will 
continue to appear in the sequel. 
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§ 55 . Protosyntax 

GIVEN ANY expression, we can decide once and for all by a syste¬ 
matic procedure of inspection whether or not it is a logical formula. 
Similarly, as soon as all the desired non-logical forms of atomic 
formulae have been listed, we can decide by a systematic procedure 
whether or not a given expression is a formula at all, logical or 
otherwise. The systematic recognizability which thus attaches to 
the notions of logical formula and formula attaches in similar 
fashion to the notions of matrix, tautology, axiom of quantification, 
and axiom of membership (cf. §§ 16, 29). The notion of theorem 
lacks this feature, though retaining a partial sort of recognizability; 
appropriate to each formula which is a theorem there is a device 
(a so-called proof), discoverable in general only by luck, which once 
discovered enables us to see by a finite amount of inspection that 
the formula is a theorem (cf. § 16). Now any syntactical property 
which enjoys even this modest sort of recognizability is called 
constructive; thus the property of being a theorem, as well as the 
quite mechanically recognizable properties of being a formula, 
matrix, tautology, or axiom, are constructive. 

This account of constructivity, like the notion itself, is vague. 
In a sharp formulation it would be necessary to avoid such vague 
phrases as ‘mechanically recognizable’, ‘recognizable by a finite 
amount of inspection ’, etc.; but then there would be an essential 
difficulty in establishing any correspondence between this precisely 
defined notion and the intuitively intended notion which those 
vague phrases help describe. There is a striking feature, however, 
which the instances of constructivity cited above have in common: 
they are not only syntactically definable, but they are definable 
even in that narrower syntax which borrows only joint denial and 
quantification from logic and eschews membership. The syn¬ 
tactical definition of ‘LFmla,*’ suggested in (1) of §53 depends 
indeed upon the whole of logic, membership included; but an 
equivalent definition is possible, as we shall see (§ 56), which 
involves only the ‘M’ of syntax plus the joint denial and quantifi- 
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cation of logic. The same is true of the other constructive notions 
mentioned above (cf. §§ 56-58), and presumably equally true of all 
syntactical notions which are constructive in the vague sense above 
suggested. 

The part of syntax which omits membership will be called pro¬ 
tosyntax. Just as the logical formulae comprise atomic formulae of 
the form r (« e |3) n and the expressions thence constructible by joint 
denial and quantification, so the protosyntactical formulae com¬ 
prise atomic formulae of the form r Ma^ 7 1 and all the expressions 
thence constructible by joint denial and quantification. This 
meager protosyntactical notation proves to constitute a sufficient 
language for expressing all the metatheorems of foregoing chapters 
— provided of course that we do away with the unspecified extra- 
logical forms of atomic formulae or at least fix upon a definite list 
of forms desired. 

Protosyntactical definability is intended not as an approximation 
to constructivity, but as something more inclusive. The notion 
of non-theorem, e.g., is protosyntactically definable, yet pre¬ 
sumably not constructive; we know of nothing, discoverable even 
by luck, which would enable us to decide that a given logical state¬ 
ment p is not a theorem. We may find that is a theorem; 

but this does not exclude the possibility that <f> is also a theorem, 
unless we assume the consistency for which we of course hope. 
Even this channel is closed if it happens, as it well may, that 
neither <j> nor is a theorem. 

Three precise explications of the vague phrase ‘mechanically recognizable’ have 
been independently propounded in recent years, and the three have been proved 
equivalent to one another. One is the notion of so-called recursiveness (closely 
connected with the kind of procedure called recursive on page 86); this was first 
formulated in a relatively narrow sense by Godel (“ Unentscheidbare Satze,” 
pp. 179-180) and afterwards extended by Godel and Herbrand (cf. Godel “Undecid- 
able Propositions”). The second is Church’s notion of ^-definability (“Unsolvable 
Problem,” pp. 3461F, 356ff), and the third is Turing’s computability. The second was 
proved equivalent to the first by Kleene, and to the third by Turing. A derivative 
notion, recursive enumerability, corresponds to constructivity; see Kleene, “Recur¬ 
sive Predicates.” But note that protosyntactical definability is broader. 

In defining the notations ‘Sr,’ ‘S 2 ’, . . . , ‘S»’, and in terms 
of ‘M’, in § 54, the notation r (ia)4> 1 of description was used. This 
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device was legitimate for purposes of syntactical definition, since it 
reduces through Dl-17 to the three logical primitives; but the 
device is not immediately available for protosyntactical definition, 
because its logical definition depends finally on ‘e’, which is lacking 
in protosyntax. Of the logical definitions presented in the fore¬ 
going chapters, only Dl-8 can be carried over into protosyntax; 
for it is only these that do not presuppose membership. 

We find, however, that the logical notion of description, and like¬ 
wise that of identity, admit of other definitions within the frame¬ 
work of protosyntax. It will first be shown that the appropriate 
sense of identity is obtainable by explaining ‘x = y’ as short for: 

(1) (z)(Mzxx = Mzyy). 

Suppose, to begin with, that x is not an expression. Then Mxxx, 
by Case 3 of § 54; hence, if (1) is true, it follows that Mxyy; but, 
by Case 3 of § 54, this identifies x with y. We see therefore that 
' x = y' in the intended sense follows from (1) when x is not an 
expression. Next suppose that x is an expression, and let w be the 
repetitive complex expression formed by writing x followed by x. 
I hen, by Case 2 of § 54, M wxx\ hence, if (1) is true, M wyy, but, 
by Case 2 of § 54, this makes y the first and last half of the repeti¬ 
tive expression w, thus identifying y with x. Regardless of whether 
■x is an expression or not, therefore, ‘x = y’ in the intended sense 
follows from (1). Conversely, moreover, (1) obviously follows from 
‘x = y\ The following definition therefore serves our purpose: 

A9. r (a = /3)" 1 for r ( 7 )(M 7 aa = My/S/S)" 1 . 

The protosyntactical definitions will be distinguished thus from the 
logical ones by using ‘A’ instead of ‘D’. Al-8 are understood as 

Dl-8. 

Definition of description in protosyntax must proceed con¬ 
textually, like that of abstraction in logic (§§ 24, 26). First let us 
deal with quasi-atomic contexts r M (i a)(j> /3 y 1 having a description 
in the first place and variables in the other places. If w is the one 
and only entity x such that . . . x •. . . , i.e. if 

(*) (x = W .=,... X ... ) 

(cf. § 27), then we want the quasi-atomic formula: 
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(2) M («r)(. ..x...)yz 

to be true for any objects y and z if and only if Mzcyz; if on the 
other hand there is no such unique entity x, then we lose interest 
and become content to have (2) turn out in any way that proves 
convenient — say as false. All this is obviously accomplished by 
explaining (2) as an abbreviation of: 

(3zc)(M«ryz . [x){x = w .= ... x 

Thus 

A10. r M fot)4> /3 y 1 for r (3i5)(Md|37 . (o:)(a = 8 .= tf)) 1 . 

Let us understand terms, for purposes of protosyntax, as com¬ 
prising variables together with descriptions r (ia )<j> 1 (where </> is a 
formula of protosyntax). Let us use ‘f’, ‘ij’, and ‘6’ hereafter to 
refer to terms in this sense. Then, repeating precisely the method 
used in A10, we can define the further quasi-atomic contexts as 
follows: 

All. r M f (ux)4> 7 n for r (35)(Mf57 . (a)(a = S .= <f> )) n . 

A12. r M f rj (i ctfcf 1 for r (35)(Mfij5 . (a)(a = S .= fi)) 1 - 

Now is explained for all terms f, tj, and 6, whether 

variables or descriptions. 

Let us next extend A9 to apply to terms in general, rather than 
simply to variables. 

A13. r (f = rj) 1 for r ( 7 )(M 7 ff = Myi/ij) 1 . 

It is convenient also to import Dll. 

A14. r (f ^ r,) 1 for r ~(f = vT- 

The definitions of ‘Si’, ‘S 2 ’, . . . , ‘S 9 \ and r f tj 1 suggested in 
§ 54 can now be set down. 

Al5. ‘Si ’for ‘(jx)(y)(z) ~ Mxyz’. 

Al6. ‘S 2 ’ for ‘ fx) Mx Si x’, ‘S 3 ’ for ‘ (ix) M* S 2 x’, etc.to'Ss’. 

Al7. r (f t 1) 1 for r (ia)(Maf?; . ~ Mafo:) n . 

A definition analogous to D6 and D7 is desirable in connection 
with the concatenation notation. 

A18. flffM-s) 1 for r ((f, 

r (fi fa U W for r ((f, fa etc. 
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Finally we carry over nine of the ten definitions suggested in 
§ 53. In third place among these it is convenient to insert a further 
definition introducing the notation ‘xPy’, in the sense ‘x is part 
(or all) of y ’. 

A19. r (fBi 7 ) n for r (f = t) .v (3a) (f a = jj))" 1 . 

A20. r (fEn) n for r (f = v -V (3a)(a~f = V )fi. 

A21. r (fPi 7 ) n for r (3a)(aB7j. fEa) n . 

A22. r Acp for r (a)(aBf .D. SsEa)" 1 . 

A23. r (feij) n for r (S, 6 'f'SfTfS 7 p. 

A24. r (fj») n for r (SeTS 8 TS 7 )". 

A25. r (f qu rj) 1 for r (S 6 T S 7 ~i)" 1 . 

A26. r Vbl f" 1 for r (3ar)(a = Si .V. a = S2 .V. a = S3 .V. a= S4 i 

r = «.V (3(3) (Ac d ■ f = cTp))\ 
A27. r (f{>i) n for r (3a)(Vbl a . f = a qu V )\ 

A28. r LFmla 0 f for r (3a)(3/3)(Vbl a . Vbl 0 . f = <xep)\ 

§ 56 . Formula and Matrix Defined 

AS REMARKED (§ 55), the definition of r LFmla f 1 suggested in 
(1) of §53 is not available to protosyntax; but a protosyntactical 
definition is possible along other lines. The point of departure, in 
devising this definition, is the following reflection: x is a logical 
formula if and only if it belongs to a series of expressions x\, * 2 , 
Xz, . . . , x n such that x;, for each t, is either an atomic logical form¬ 
ula or else a joint denial of earlier expressions of the series or else a 
quantification of an earlier expression of the series. 

Instead of speaking of a series of expressions X\, Xt, . . . , x n , the 
device suggests itself of speaking rather of a single long expression 
composed of the successive expressions Xu x%, . . . , x n . But we 
encounter a difficulty: analysis of the long expression does not 
reveal uniquely the intended parts xi, x^, . . . , x n . Given the long 
expression, we have no way of knowing how much of it was intended 
as xi, nor how much of the remainder was intended as xz, and so on. 
This difficulty, however, is easily overcome. Instead of taking our 
long expression as x\ x 2 ... x n , we can take it as 
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S 6 S7 Xl be S7 xffilG S 7 ... X n Se S 7, 
thus framing each of x L , x 2 , . . . , x n with occurrences of the arbitrary 
expression Se S 7 , i.e. ‘O’. Given this long expression, the parts 
Xi , xt, . . . , x n are uniquely determined as the segments reaching 
from one occurrence of Se S7 to the next; for we know that the 
ungrammatical combination of signs Se S 7 will not have any addi¬ 
tional occurrences inside the formulae xi, x 2 , . . ■ , x n . Thus, if the 
segments reaching from one occurrence of S 6 S7 to the next in an 
expression z be called the framed ingredients of z, we can explain a 
logical formula as any framed ingredient of an expression z each of 
whose framed ingredients is either an atomic logical formula or a 
joint denial of previous framed ingredients or a quantification of a 
previous framed ingredient. Such an expression z will be essen¬ 
tially like the long expression considered above; it may depart 
from that pattern only in a couple of inessential respects. Chang¬ 
ing S 6 S 7 to S 6 S 7 S 6 S7, e.g., would make no difference so far as 
framed ingredients are concerned; nor would annexation of further 
matter at either end, so long as the added matter contained no 
occurrences of S« S 7 . 

Let us make the above definitions explicit. To say that x is a 
framed ingredient of z, symbolically x Ing z, is to say that S G S7 x 
SefL is part (or all) of z and S G ~S 7 is not part of x. 

A29. r (f Ing rj ) 1 for r (S G ~S 7 f~S 6 S 7 P y . ~(S G S 7 P f))T 

One framed ingredient x of z is prior to another one y, symbolically 
x Pc y, if x is a framed ingredient of an initial segment of z where¬ 
of y is not a framed ingredient. 

A30. r (f Pr, 0) 1 for r (3a)(aB?). f Ing a . 6 Ing 77 . ~(0 Ing a))" 1 . 
The formal definition of logical formula, then, is as follows. 

(1) TLFmla for r (3a)(f Ing a . Ing a .3. LFmlao (3 V 
(3y)(35) (yPc* 0.8 Pr a 0 : 0 = y'}8 .v. 0Qy))y. 

Just as A28 provides for atomic formulae of the form r (ae0y, 
other analogous protosyntactical definitions will provide for each 
further form of atomic formula once these further forms have been 
chosen. In preparation for such definitions we would presumably 
so reconstrue r Ma/3T 1 as to allow a longer alphabet, running beyond 
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S 9 to include such supplementary signs as may be wanted in the 
non-logical atomic formulae (cf. §54); but even this departure is 
not strictly necessary, for we could simply use the novel com¬ 
binations V’, V", etc. in lieu of such supplementary signs and 
thus keep within our nine-sign alphabet. If we bring together the 
definitions appropriate to the several atomic forms, then, by al¬ 
ternation, we obtain a general definition of r Fmla 0 f 1 . On this 
basis, in turn, we can define the general notion of formula in exact 
analogy to the definition (1) of logical formula. 

A31. r Fmla for r (3a)(f Ing a . (0)(P Ing a .D. Fmlao/3 v 

(37)(35)(7Pra j8 . 5 Pr a <3 : |3 = 7j5 .V. ^Qy)))" 1 . 

Definitions will hereafter be set up in the manner of A31 rather 
than (1), under the Active assumption that a general definition of 
r Fmla 0 is at hand in place of A28. Just as A31 then defines the 
general notion of formula, logical and otherwise, so in subsequent 
definitions the notions of matrix, tautology, and theorem are 
defined in their full generality. If in particular the atomic formulae 
are construed simply as the logical atomic formulae, so that 
r Fmlaof n comes to be defined by A28 itself, then A31 ceases to 
differ at all from (1); and correspondingly for the subsequent 
definitions. 

Definition of the notion of matrix presupposes the notions of 
bondage and freedom, basic to which is the notion of bound oc¬ 
currence. We have first of all to ask, what is an occurrence ? what 
kind of thing e.g. is the third occurrence of an expression * in an 
expression y\ as distinct from the mere expression or typographic 
shape x itself? An answer, artificial but convenient and adequate, 
is this: an occurrence of x in y is an initial segment of y ending in 
x. The third “occurrence” of * in y, e.g., is construed as that 
initial segment of y which remains when (as we should ordinarily 
say) everything after the third occurrence of x is lopped off. A 
“later occurrence” of x differs from an “earlier” one in being a 
longer initial segment of y. 

Thus z may be said to be an occurrence of x in y, symbolically 
z O y x, if zBy and xEz. But it is better to refine the definition in 
such a way as to except accented occurrence; in the context 
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‘z « tv"’, e.g., it is desirable to recognize an occurrence of ‘w"' but 
to deny any occurrence of the variables ‘tv n and ‘tv’. This refine¬ 
ment is accomplished by supplementing ‘zBy’ and ‘xEz’ with the 
further clause ‘~(z~S 6 B y)’. 

A32. '"(fOflTj) 1 for r (fB0. S 5 B 6) . ijEf)" 1 . 

To say that x occurs in y, symbolically x In y, is to say that there 
is an occurrence of x in y. 

A33. r (f In t;)" 1 for r (3a)(aO,f)^. 

Obviously ‘xlny’ goes beyond ‘xPy’ only to the extent of 
excepting accented occurrence. 

An occurrence of a variable is bound in a given formula when it 
falls within a formula which forms part (or all) of the given formula 
and begins with a quantifier containing the same variable. But 
now that an “occurrence” is construed as an initial segment, the 
formulation of bondage assumes this form: z is a bound occur¬ 
rence of x in y, symbolically z BO y x, if x is a variable and z is an 
occurrence of x in y and there is a formula * quit which begins 
somewhere in the segment z and ends at or after the end of z 
(but still in y). More rigorously, 

z BO y x . = . Vbl x . zOyX . (3w)(3z')(3x')(Fmla tv . 

z = z’~x . z' x' O y xqutv . ~(xqu w E *'))• 

Thus 

A34. r (f BO(j i}) 1 for r (Vbl i).fO gr) . (3a)(3|3)(37)(FmIa a . 

f = P y . P yOg ? 7 qua . ~(T?qu<* E y)))" 1 . 

A free occurrence is one which is not bound. 

A35. r (f FOfl j/)" 1 for r (VbI j? . f 0$ n . ~(f BO# jf)) n . 

A variable x is free in y, symbolically xFy, if it has a free occur¬ 
rence in y. 

A36. r (fF„) n for r (3a)(a FO„ r)T 

Matrices, finally, are formulae with free variables, and statements 
are other formulae. 

A37. r Mat for r (Fmla f . (3a)(<* Ff))" 1 . 

A38. r Stat for r (Fmla f . ~ Mat f)\ 
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§ 57 . Axioms of Quantification Defined 

WE TURN next to the problem of defining tautology protosyn- 
tactically. As an auxiliary we need the notion of truth-functional 
component (cf. § 10). An immediate truth-functional component 
of x is anything y such that x is yjz or zjy for some formula z; and 
w is a truth-functional component of x, symbolically wTCx, if w 
belongs to a sequence of expressions each of which is either * or an 
immediate truth-functional component of an earlier expression of 
the sequence. The notion of sequence involved here can be dealt 
with by the same device which was used in defining formula 
(§ 56). Thus 

A39. r (fTC 7 j )' 1 for r (3a)(f Ing a . (/3)(j8 Ing a .3: p = tj .v 

(3y)(3S)(y Pr« 0 . Fmla 5 : y = /3j5 .v. 7 = Sj0))) n . 

As a further auxiliary in defining tautology we need the notion of 
truth set (cf. § 10) — or, since this would involve membership and 
thus exceed the bounds of protosyntax, we may avail ourselves 
rather of a parallel notion of truth sequence. A truth sequence of x 
is any expression whose framed ingredients constitute a class S 
fulfilling (I) of page 53. Thus to say that y is a truth sequence of 
x, symbolically yTSx, is to say that any joint denial zj to which is a 
truth-functional component of x is a framed ingredient of y if and 
only if neither z nor to is a framed ingredient of y. 

A40. r (f TS i ?)" 1 for r (a)(/3)(Fmla a . aj/S TC 17 .3 

aj/3 Ing f . =: a Ing f . J,. 0 Ing f)T 

To say that a formula x is tautologous, finally, symbolically 
Taut x, is to say that x belongs to each of its truth sets (cf. § 10); 
in other words, that x is a framed ingredient of each of its truth 
sequences. 

A41. Taut f 1 for r (Fmla f . (a)(a TS f .3. f Ing a))" 1 . 

In defining tautology we have made some progress towards 
definition of the axioms of quantification. For the latter purpose a 
second necessary step is definition of the notion of closure; and as 
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a preliminary to this, in turn, we need the notion of alphabetic 
succession of variables. The alphabet of variables is the infinite 
alphabet: 

w, x, y, z, w', x', y', z', w", . . .; 
it must not be confused with the finite alphabet of signs: 

w, x, y, z, ', (, ), i , €. 

Thus the alphabetic successor of a variable x, symbolically vs x, 
begins with S 2 or S 3 or S 4 or Si S 5 according as x begins with Si or 
S 2 or S 3 or S 4 ; and the rest of vs x, if any, is like the rest of x. 

A42. r vs for r (ia) (3/3) (3 t)(j 3 = Si. 7 = S 2 .v. /3 = S 2 . y = S 3 .v. 
0 = S 3 .7 = S 4 .v. - S 4 .7 = Si S 5 J 
f = 0 . a = 7 -V (3 8)(t = P S . a = 7 5)) 1 . 

A variable x is alphabetically prior to a variable y, symbolically 
x VPr y, if y is vs x or vs vs x or vs vs vs x or etc; or, equivalently, 
if vs x or vs vs x or vs vs vs x or vs vs vs vs x is an initial segment 
of y. 

A43. r (f VPr t;) 1 for r (Vbl f . Vbl i) ‘. vs f B .v. vs vs f B rj .v. 

vs vs vs £ B t) -V. vs vs vs vs f B ij)" 1 . 

If we had dispensed with ‘x\ ‘y’, and ‘z’ and limited our variables 
to ‘to’ with and without accents, A42 and A43 would have been 
greatly simplified; the successor of a variable would be the same 
variable accented, and one variable would be prior to another if 
initial to it. 

As a next step towards the notion of closure we must define 
‘xlFy’.in the sense ‘ x is alphabetically the first free variable of y’. 

A44. r (f 1 F 7))" 1 for r (f F 77 . ~ (3a)(a Fij.a VPr f))" 1 . 

Now the closure of x, symbolically cl x, is readily defined in con¬ 
formity with § 14. 

A45. r cl for r (ia)(Fmla a . f Ea . 

( 73 )(Fmla /3 . fE/3 »/3Ea .D (y)(Vbl y . yqu/3 E a . = . y IF /3)))" 1 . 

We can now define the axioms of quantification of kind A (cf. 
§ 15). To say that x is one of these, symbolically AQa*> is to say 
that x is the closure of a tautologous formula. 
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A46. p AQa f 1 for r (3a!)(Taut a . f = cla)" 1 . 

Preparatory to defining the other kinds of axioms of quantifica¬ 
tion, it is convenient to adopt the following definitions. 

A47. r nf~ l for r (fjf) 1 . 

A48. r (f al ?j) n for r n(fjj!)" 1 . 

A49. r (f cd ij) 1 for r (nf al r/) 1 . 

Comparison of A47 with Dl shows that, where x is a formula, nx is 
the denial of x (in primitive notation, of course). Comparison of 
A48 with D3 shows similarly that x al y is the alternation formed 
from x and y in that order; and comparison of A49 with D4 shows 
that x cd y is the conditional whose antecedent is x and whose con¬ 
sequent is y. 

Now the kinds B and C of axioms of quantification are readily 
defined. 

A50. r AQis f " 1 for r ( 3 a)( 3 / 3 )( 37 )(Vbl a . Fmla /3 . Fmla y . 

f = cl((a qu (/3 cd 7 )) cd ((a qu /3) cd (a qu 7 ))))" 1 . 
A51. r AQcf for r (3a)(3/S)(Vbl a . Fmla 0 . ~(aF/S) . 
f = cl (/3 cd (a qu 0)))\ 

Let us write 'x' SFJ x’ to mean that x' is like x except for con¬ 
taining free occurrences of y' wherever a contains free occurrences 
of y. (The ‘S’ and ‘F’ here are to suggest the words ‘substitution’ 
and ‘free’.) This must be defined before we can define the quanti- 
ficational axioms of kind D. Preparatory to defining ‘ x' SF^x’ it 
is convenient to define ‘ x' SFly x’, in the sense ‘x' is like ,*except 
for containing a free occurrence of y' in place of one free occurrence 
of y’. 

A52. r (f’ SFl^C) n for r (3a)(3a')(aFO fl7 . a'FOf- 7 ,’. 

G8)(« “ 0v a' = jflfy : f = eTfi . = . r = 

Next let us write ‘x' SF?J x’ to mean that x' is like x except for 
containing free occurrences of y’ at ^0 places where x contains 
free occurrences of y; in other words, that x' belongs to a sequence 
of expressions each of which is either x or else like an earlier ex¬ 
pression of the sequence except for containing a free occurrence of 
y' in place of a free occurrence of y. The notion of sequence in¬ 
volved here can be dealt with as in A31 and A39. 



302 


SYNTAX 


§58 


A53:. r (f' SF ?’' f) n for r (3a)(f' Ing a . (0)03 Ing a .D: 0 = f .V 

(3t)(t Pr a 0. 0 SFlJJ' t))) 1 . 

Finally ‘x' SFJ x’ says that x' SF?y x and further that no free 
occurrences of y are left standing in x — unless of course y and y' 
are the same variable. 

A54. r (r'SF?'?)i fof r (f / SF?? ' f =v')\ 

Definition of the axioms of quantification of kind D is now easy. 

A55. r AQ D f n for r (3a)(3a')(30)(30')(Vbl a . Vbl a' . Fmla 0 . 

/3'SFS'^.f = cl ((a qu 0) cd 0'))\ 

§58. Theorem Defined 

PREPARATORY to defining the axioms of membership, we need 
the notion of stratification. The earlier explanation of stratification 
(§ 28) involved putting numerals for variables; by way of keeping 
within our protosyntactical resources, however, let us now use 
strings of arrows instead of numerals — n arrows instead of the 
«th positive numeral. Sacrifice of ‘0’ and the negative numerals 
is obviously of no consequence; stratification established with help 
of non-positive numerals can be established equally well by use of 
uniformly higher numerals all of which are positive. 

First we adopt three definitions closely analogous to A52-54. 


A56. 

r (r , si?'*r 

for 

r (3«)(3a')(a 

Of r). a Of/ 1)'. 


(/3) (a = 

0 t] . = . a' = $ 

V : r = a0 afi)y. 

A57. 

r (f'S?U) n 

for 

r (3a)(f' Ing 

a ■ 08)03 Ing a -D : 0 = f .V 





(3t)(t Pr a 0.0 Sl^ t))) 1 . 

A58. 

r (r s ur 

for 


(V In f')) n - 


Where y and y' are any expressions, clearly ‘x' Si* x’ says that x' 
is like x except for containing y' in place of an occurrence of y. So 
long as x, y, and y' are not such that substitution of y' for y in x 
gives rise to new occurrences of y, it is clear further that ‘ x' S ?* x ' 
holds if and only if x' is like x except for containing y' in place of 
^0 occurrences of y, and that ‘x' S y x’ holds if and only if x' is 
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like x except for containing y' in place of all occurrences of y. In 
particular these versions are correct in those cases where y is a 
variable and y' a string of arrows, since clearly no new occurrence 
of a variable y can be generated by putting a string of arrows for 
y in any context. 

‘ At x’ will mean that x is a string of arrows. The definition 
resembles A22. 

A59. r Arf for ' r (a)(aBr .3. S 8 Ea)A 

Next let us speak of x as an arrow projection of y, symbolically 
xAPy, when x is formed from y by putting strings of arrows for all 
occurrences of ^0 variables (like strings for like variables); in 
other words, when x is one of a sequence of expressions each of 
which either is y or else is like an earlier expression of the sequence 
except for containing some string of arrows in place of all occur¬ 
rences of some variable. Thus, using our usual sequence technique, 

A60. r (fAPtj)' 1 for r (3a)(f Ing a . (/3)(/3 Ing a .3 : j3 = »? .V 

(3y)(35)(3/3')(Ar y . Vbl 5 . ? Pr„ /S .0 S£ 0'))) 1 - 

Finally, a formula x is stratified if it has an arrow projection 
wherein (i) no variables are left unreplaced by strings of arrows 
and (ii) a string of arrows following V is always longer by one 
than the string preceding. 

A61. r Strat for r (Fmla f . (3<*)(aAP f . ~(30)(Vbl 0.0 Pa). 

( 7 )(S)(Ar y . Ar 5 . -yeS Pa ,3. 5= y Ss))) 1 . 

The following definitions, needed as further preparation for the 
axioms of membership, are related to D2, D5, and D8 as A47-49 
were related to Dl, D3, and D4. 


A62. 

r (i' cj V) 1 

for 

r (nf j n n)\ 

A63. 

r (ab v r 

for 

r ((t cd v ) cj (v cd f)) 1 . 

A 64. 

r (f qe I?) -1 

for 

r n(f qu nrj) 1 . 


Thus, where x and y are formulae, x cj y is their conjunction and 
#by is their biconditional (expanded into primitive notation); 
and, where z is a variable, z qe x is the existential quantification of x 
with respect to z. 

Similarly, where x and y are variables, their identity will be 
called x id y. More accurately, x id y is to be the formula (in primi- 
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tive notation) whose abbreviation via Dl-10 is formed by putting x 
and y in the respective blanks of‘( — )’. The odd bound variable 
a of DIO may conveniently be understood always as the alphabeti¬ 
cally earliest variable other than x and y — hence the alphabetically 
earliest variable foreign to x y — in conformity with the plan sug¬ 
gested earlier (§ 24). In general, let us speak of the alphabetically 
earliest variable foreign to z as fz. 

A65. r ff for r (m)(~(a In f) . Vbl a . (0)03 VPr a .D. 0 In f)) n . 
Then, in view of DIO, 

A66. r (f id ij) 1 for r (f (fv) qu ((f(f^/) e f) b (f(£~,) e , ))) 1 . 

When x and y are variables and z is a formula, x e yaz is to be 
the formula (in primitive notation) whose abbreviation via Dl-9 
is formed by putting x in the first blank of ‘ ( e* ) ’, z in the 
last, and y under the circumflex. The definition is apparent from 
D9. The odd variable y of D9 is taken as the alphabetically 
earliest variable other than x, y, and the variables of z — hence as 
f(*~y~ z). 

A67, r (f e ,a0) n for 

r (f(f fO) qe((f e f(f>f 9)) cj (, qu ((, e f(f~f 9)) cd 0)))) 1 . 

Where x, y, and z are as before, x id yaz is to be the formula 
whose abbreviation via Dl-10 is formed by putting x in the first 
blank of *( =* )’, z in the last, and y under the circumflex. 

Thus, in view of DIO, 

A68. r (f id lyafl)" 1 for r (f(f~ f9) qu((f(f~i?~0) e f) b (f(fT fO) e vad))) 1 . 

Where x and y are variables and z and w are formulae, ^raz e yaw 
is to be the formula whose abbreviation via Dl-12 is formed by 
putting z and w in the respective blanks and * and y under the 
respective circumflexes of ‘(* « A )’. Thus, in view of D12, 

A 69. r (fa t) e f'ai j')" 1 for 

r mTt'~v') qe ((f(f»TV) id fa,) cj (ftfYTV) e fV))) n . 

Where a is a variable, x e v is to be the formula whose abbrevi¬ 
ation via Dl-15 is formed by putting x in the blank of ‘( «V)’; 
and, where y is a formula, xay e v is to be the formula whose 
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abbreviation is formed by putting x under the circumflex and y in 
the blank of T «V)\ Thus, in view of D15, 

A70. ^fev ) 1 for r (f e S 2 a(S 2 id S 2 ))A 
A71. r (f a *7 e v)" 1 for r (fajj e S 2 a(S 2 id S2)) 1 . 

Let us write ‘R*’ to mean that every quantification which x con¬ 
tains is restricted to elements (except for occurrences of quantifi¬ 
cations within the definitional expansions of the restricting clauses 
themselves). 

A 72. r Rf n for r (a)(0)(7)(Vbl a . Fmla /3.7 O f aqu/3 .3 

(35)(35')(|8 = (aev) cd 5 .v. Vbl 5 . y~5' O f 5ev . ~ 5ev P 5')) 1 . 

Finally let us write ‘AM a .t’ to mean that x is an axiom of mem¬ 
bership of the first kind, i.e., an axiom of elementhood. For sim¬ 
plicity of definition I shall construe these axioms somewhat more 
broadly now than in § 28; but the reader can easily satisfy himself 
that this enlargement of the class of axioms of elementhood is 
inessential, in the sense of not enlarging the class of theorems. 

A73. r AMAf n for r (3a)(30)(37)(Fmla a . Vbl 0 . Strat 7 . R7 . 
(5) (6 F7.5 76 0 .3 Taut acd(5ev)) . f = cl(a cd (0a7 e v)))" 1 . 

The other two kinds of axioms of membership, described in § 29, 
present no difficulty. 

A74. r AM B f 1 for r (3 a) (3 c/) (30) (30') (Vbl a. Vbl a'. 

Fmlao ft . 0' SIS' 0 . f = cl((« id a') cd (0 cd 0')))t 
A75. r AM c f 1 for r (3«)(3/3)(3y)(Vbl a . Vbl 0 . Fmla 7 . 

0 y* oc. (0F7) . f = cl(0 qe (a qu ((a e 0) b ((a e v) cj 7))))) 1 . 

The general notion of axiom is now definable. 

A76. r Axf for r (AQ A f v AQ B f v AQ C f v AQ D f v 

AMa t v AMb r V AMc r)v 
Now x is a theorem if it belongs to a sequence of expressions each 
of which is an axiom or a ponential of earlier expressions of the 
sequence. 

A77. Thm for r (3a)(f Ing a . (0)(0 Ing a .3. Ax 0 V 

(37) (7 Pr<* 0 ■ 7cd0 Pr a 0))) 1 . 
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§59. Protosyntax Self-Applied 

IN § 55-58 we have considered the protosyntax of logic; let us 
now consider the protosyntax of protosyntax itself. Instead of 
using the notation of protosyntax to discuss expressions composed 
of the alphabet: 

> (> )» J. > *> 

we shall now use it to discuss expressions composed of the very 
alphabet: 

w, x, y, z, |, M 

in which protosyntax itself is couched. The explanation of 
'Marys’ in § 54 accordingly needs to be reconstrued to this slight 
extent: the alphabet mentioned under Case 1 ceases to have the 
typographical shape ‘e’ as its ninth member, and comes to have the 
shape ‘M’ instead. In other words, S 9 ceases to be ‘t ’ and becomes 
‘M’. 

The protosyntactical definition of protosyntactical formula pro¬ 
ceeds just like the protosyntactical definition of logical formula, ex¬ 
cept that the atomic formula are now described as having the form 
r Ma(87 n instead of r (a e|3) n . Thus the definitions Al-30 are taken 
over intact except that A23 and A28 give way to: 

r m{f, ij, for r (S 9 7Y0)\ 

r PFmla 0 for 

r (3a)(3/3)(3'y)(Vbl a . Vbl /3 . Vbl 7 . f = m { a, f}, 7 )) n ; 

and A31 is taken over with ‘Fmla 0 ’ and ‘Fmla’ changed to 
‘PFmlao’ and ‘PFmla’. 

Just as the theorems of logic comprised the ponentials of ponen- 
tials of. . . axioms of quantification and axioms of membership, so 
the theorems of protosyntax would comprise the ponentials of 
ponentials of. . . axioms of quantification and various axioms 
appropriate to ‘M’. The protosyntactical definition of protosyn¬ 
tactical theorem would thus proceed analogously to the protosyn¬ 
tactical definition of theorem hitherto constructed for logic. We 
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would first define the axioms of quantification as in §57; next, 
instead of defining axioms of membership as in § 58, we would 
define appropriate axioms governing ‘M’; and from here we would 
proceed to ‘PThm#’, ‘x is a protosyntactical theorem’, through 
definitions precisely parallel to A76-77. 

What should we choose by way of axioms governing ‘M’? An 
ideal choice would be such that the theorems would come to exhaust 
those protosyntactical formulae which are true statements, and of 
course exclude all which are false. It turns out, however, that 
such a choice is impossible — so long as we insist on its specifi- 
ability in protosyntactical terms. It turns out that no proto- 
syntactically definable notion of protosyntactical theorem can 
exhaust the protosyntactical truths and exclude the falsehoods. 
The remainder of the section will be devoted to establishing this 
conclusion, which has far-reaching significance. It will be shown 
that if r PThm is so construed as to be translatable into proto¬ 
syntax at all then there is a protosyntactical statement which is 
true if and only if it is not a theorem. If the statement in question 
is true then it is not a theorem, so that the theorems do not exhaust 
the protosyntactical truths; and otherwise it is a theorem, so that 
the theorems do not exclude falsehoods. 

In a word, then, our problem is to construct a protosyntactical 
statement to the effect: 

(1) (1) is not a theorem. 

As it stands, the statement (1) refers to itself by means of a de¬ 
monstrative pronoun ‘ (1) ’; but we can gain the same effect without 
this device. If for purposes of a preliminary venture we allow 
appeal to quotation, we can reformulate the self-mentioning state¬ 
ment (1) thus: 

(2) The result of putting the quotation of ‘The result of 
putting the quotation of w for Si in w is not a theorem’ 
for Si in ‘The result of putting the quotation of w for 
Si in w is not a theorem’ lsmot a theorem. 

This is more intelligible than it looks. It says that a certain result 
is not a theorem. It says, moreover, how to construct the result 
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in question. To construct it we are to substitute a certain quota¬ 
tion for Si (not for ‘Si’ but for Si, which is the variable ‘w’) in 
the matrix: 

(3) The result of putting the quotation of w for Si in w is 
not a theorem. 

Moreover, the quotation which we are told to substitute is the quo¬ 
tation of (3) itself—i.e., it is (3) plus an enclosing pair of quota¬ 
tion marks. When we perform this substitution we come out with 
(2) again as our result. So (2) says that (2) is not a theorem. 

When we try to put (2) into the notation of protosyntax, the 
problem confronts us of somehow translating or eliminating the two 
idioms ‘result of putting’ and ‘quotation of’. Let us deal with the 

former of these first. Imagining ‘- ’ to be some expression, and 

imagining ‘... w .. .’ to be some matrix with free ‘w’, we observe 

to begin with that the result of putting ‘-’ for Si (i.e. for ‘ze’) 

in ‘. . . w . . .’ is equivalent to: 

(w)(w = —— .D ... w .. .) 

(cf. *234a), This observation may, with help of the notations in¬ 
troduced in A25 and A49 (pp. 295, 301), be rephrased as follows: 
the result of putting an expression u for free Si in a matrix y is equiv¬ 
alent to S]qu(z cd y) where z consists of ‘w =’ followed by u. 
Now if in particular we take u as the quotation of some expression 
x, the foregoing observation takes on the following form: 

(4) The result of putting the quotation of x for free Si 
in y is equivalent to Siqu(z cd y) where z is ‘w = fol¬ 
lowed by the quotation of x. 

Quotation, we know, is extraneous to the notation and subject 
matter of protosyntax. Nevertheless it is possible in protosyntax 
to construct corresponding to any given expression x a matrix which 
as a whole is equivalent to ‘w = ’ followed by the quotation of x. 
How this can be done will be seen later; meanwhile let us suppose 
it done and get on. The matrix in question will be called the 
identity matrix of x, briefly im x. Thus 

(5) im x is equivalent to ‘w = ’ followed by the quotation of x. 
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By (4), then, 

(6) The result of putting the quotation of x for free Si in y 
is equivalent to Siqu(imx cd y). 

Accordingly the purpose of (3) will be served just as well by: 

(7) —PThm Siqu(imw cd w). 

Then (2) becomes: 

(8) The result of putting the quotation of ‘~PThm Siqu 
(imw cd w)’ for Si in ‘~PThm Siqu(imw cd w)’ is not a 
theorem. 

Now if in turn we translate (8) as a whole in conformity with (6), 
we obtain: 

(9) ~PThm Siqu(im‘'—'PThm Siqu(imw cd w)' cd 

‘~PThm Siqu(imw cd iv)’). 

We are imagining that r PThm f 1 has some protosyntactical def¬ 
inition. Let us also suppose that we have succeeded (as in time we 
shall) in devising an appropriate protosyntactical definition for 
r im f" 1 . Then (7) can be expanded into the primitive notation of 
protosyntax. The expanded formula can in turn be named in proto¬ 
syntax, by spelling (cf. page 284). This name, which we may ab¬ 
breviate by the letter ‘H’, is as good as a quotation of (7); so (9) 
becomes: 

(10) ~PThm Siqu(imH cd H). 

Here — pending our remaining task of defining r im — is a pro¬ 
tosyntactical analogue of (2) or indeed of (1). Here, under abbre¬ 
viation, is a protosyntactical formula which is true if and only if 
it is not a protosyntactical theorem. The existence of such a for¬ 
mula proves that the theorems either exclude some protosyntactical 
truths or include some falsehoods. The argument holds for any 
protosyntactically definable concept of protosyntactical theorem. 

We have now to fill in the one gap in the argument; namely, to 
devise a protosyntactical definition of r im conforming, for every 
protosyntactical expression x, to the requirement (5). 

Just as in §§ 57-58 we adopted definitions A47-49, A62-64, and 
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A66 for convenience in referring to those formulae which the logical 
definitions Dl-5, D8 and DIO serve to abbreviate, let us now adopt 
definitions which will enable us similarly to refer to those proto- 
syntactical formulae which the protosyntactical definitions Al-5, 
A8, and A9 serve to abbreviate. The first six of these new def¬ 
initions, introducing r nf n , r f al rp, r f cd j?" 1 , r f cj r/ n , b rp, and 
qe ip, simply duplicate A47-49 and A62-64; for Al-5 and A8 
are the same as Dl-5 and D8. The remaining one of our new 
definitions is related to A9 as A66 is related to DIO; A65 is to be 
understood as carried over in preparation. 

r (f idp v) 1 for r (f(f»?) qu (mjf(f"r?), f} b mjf^Tj), 17, ij}))L 

Thus, where a and y are variables, x idp y is the protosyntactical 
formula whose definitional abbreviation via Al-9 is ‘( = )’ 

with x and y in the respective blanks. 

Now let Zi be the protosyntactical formula ‘ (x)(y) ~ Mtvxy’, or 
rather the primitive expansion: 

(*)(y)(M«wy J, M wxy) 

thereof; i.e., 

‘Zi’ for ‘(S 2 qu (S 3 qu n m{Si, S 2 , S 8 }))’. 

Let Z*> be the primitive expansion of the result of putting Zi in the 
blank of: 

('ll) (3v) ((3w) (# = w . ) . M wxw); 

i.e., 

for < (S 2 qe((S 1 qe((S 2 id P Si)cjZ 1 ))cjm{Si,S 2 ,S 1 }))’. 

Similarly let Z 3 be the primitive expansion of the result of putting 
'Ll in the blank of (11); and so on to Z 9 . The definitions of ‘Z3’, . . . , 
‘Z9’ are just like the above definition of ‘Lf except for containing 
‘TsP, • • • > ‘Z 8 ’ in place of ‘Lf. 

We have noted (§ 54) that C*)(y) ~ M wxy if and only if w = Si. 
Thus the formula Zi is equivalent to ‘w = Si’. Accordingly, put¬ 
ting ‘w = Si’ in the blank of (11), we conclude that Z \ is equivalent 
to: 

(3v)((3 w){x = to . w — Si) . M wxw), 
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which is equivalent in turn (cf. *234b) to ‘MwSiw’ and hence to 
‘to = S 2 ’ (cf. § 54). Putting the latter in the blank of (11), we con¬ 
clude in similar fashion that Z 3 is equivalent to ‘to = S 3 ’; and so 
on. Zi, Zs, . . . , Z 9 are protosyntactical formulae (in primitive 
notation) which are equivalent respectively to ‘to = Si’, ‘tv = S 2 ’, 
. . . , ‘to = S 9 ’. It is therefore consonant with (5) to explain im St¬ 
as Z,-, for each i from 1 to 9. But there remains the task of formu¬ 
lating im x where x is longer than a single sign. 

By z cid z', next, will be meant the primitive expansion of the 
result of putting z and z' (whatever expressions these may be) in 
the respective blanks of: 

(12) (3x)(3y)((3w)(# = w . ) . (3w)(y = w . ) . Mwxy . 

~ M wxw). 

Thus 

r (f cid ij)' 1 for r (S 2 qe (S 3 qe ((((Si qe ((S 2 id P Si) cj f)) cj 
(Si qe ((S 3 id P Si) cj t?))) cj mf Si, S 2 , S 3 }) cj nmjSi, S 2 , Si}))) n . 

Consider, e.g., Z h cid Z* (supposing h and k chosen from among 1, 2, 
. . ., 9). Since Z 4 and Z* are equivalent respectively to ‘tv = S*’ 
and ‘w = S*% Z 4 cid Z 4 is equivalent to the result of putting 
‘w = S*’ and ‘to = S*’ in the respective blanks of (12). But this 
reduces to ‘M«;S 4 Sfc . ~M«jS 4 w’ (cf. *234b), which is equivalent 
in turn to ‘w = S 4 St’ (cf. § 54). Thus Z 4 cid Z* is equivalent to 
‘w = Si'S*’. 1 By the same reasoning, again, (Z 4 cid Z*) cid Z m is 
seen to be equivalent to 'w - S* S* S m ’; likewise ((ZiddZ*) 
cid Z m ) cid Z„ is equivalent to ‘w = Si~S*TS m ~S„’; and so on. In 
general, (. . . ((Z ?1 cid Z Ja ) cid Z qi ) . . . ) cid Z? f is a protosyntactic 
formula equivalent to ‘w = S 9l S,f. . . S 9 ,’, and hence exactly fits 
the characterization (5) of im x when x is S Sl S 9l . . . S,,.. Since each 
expression in protosyntax is S ?I S 52 ... S? r for one or another 
choice of q h qt, . . . , q r (where r ^ 1 and 1 5; ^ 9 for each i), 

we have now succeeded in identifying im x with a determinate pro¬ 
tosyntactical expression for every determinate choice of x. We ex¬ 
plain im S; as Z ; for each i, and im(S,- S,-) as Z t - cid Zfor each 
i and j, and im(S, S,- Si) as (Zcid Z,) cid Z* for each 1 , j, and 

* The connective ‘cid’ is supposed to suggest the words ‘concatenation’ and 
‘identity.’ 
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k, and so on. 

But there still remains the further task of devising a protosyn- 
tactical definition of ‘im x’ where x is unspecified. In seeking this 
objective it will be convenient to concentrate not on im x itself 
but on im* x, the concatenate of im * with x. From mix x as a 
whole then we can single out im * afterward as the initial proto- 
syntactical formula thereof; for a survey of our primitive notation 
reveals that a string of signs cannot have two formula as initial 
segments, when the full parentheses appropriate to joint denial are 
taken into consideration. 

Let us write ‘ IMC w’ to mean that w is im* * for some x. To see 
how to express this in protosyntax, we reflect that ‘IMC w' is to 
hold if and only if w is either ZS, for some i, or (Z* cid Z,-) Si S, for 
some i and j, or ((Zi cid Z,) cid Z*) S,- S~S* for some j, j, and 
k , or etc. In other words, IMC w if and only if w is one of a 
sequence of expressions each of which is either Zi S< for some i or 
else (z cid Z,) y Si for some i where z’y is an earlier expression of 
the sequence (and z is a formula). Accordingly, applying our usual 
sequence technique, 

r IMC r 1 for r (3«)(r Ing « . Ing a D: 0 = Zf Si .v.. 

.V. & = ZfS. .v (37).(35)(PFmla y . y"S Pr„ (3 : 0 = (7 cid Zi) 8 Si .V. 

. . . .v. jS = (7 cid Z 9 ) ifSg)))V 

Now the definition of r im f is obvious. 

r im f" 1 for r (io:)(PFmla a . IMC a f) 1 . 


§ 60. Incompleteness 

WE HAVE seen that protosyntax is protosyntactically incompletable, 
in the sense that no protosyntactically definable notion of proto- 
syntactical theorem can exhaust the protosyntactical truths and 
exclude the falsehoods. Protosyntactical truth is protosyntacti¬ 
cally indefinable. Where K is the class of all those statements which 

1 Obviously the dots . .’ here mark no breach of rigor. The reader can write 
in the fourteen missing clauses. 
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are built up of atomic matrices r Ma 0 y 1 by joint denial and quanti¬ 
fication and are in fact true under the intended interpretation of 
‘M’, ‘ j, etc., no matrix ‘...a?...’ composed of those same 
materials will fulfill the condition: 

(1) (*)(. . . x . . . .=. x « K). 

In the course of the present section we shall find that logic itself 
is likewise protosyntactically incompletable; that no protosyn- 
tactically definable notion of logical theorem can exhaust the 
logical truths and exclude the falsehoods. Logical truth, like proto- 
syntactical truth, is protosyntactically indefinable. 

‘M’ was explained (§ 54) in a fashion equivalent to the following: 
Mxyz if and only if 

(i) x and y are respectively ‘x’ and V, or ‘y’ and ‘x’, or. . . 
(through the nine-sign alphabet), or else 

(ii) y and z are expressions (composed of those nine signs) and x 
consists of y followed by z, or else 

(iii) x is not an expression and y is x. 

Now obviously the truth of the statements in K is not contingent 
upon geometrical peculiarities of the particular typographical 
shapes ‘w’, ‘x’, ‘y’, V, ‘(’, etc. which were chosen for purposes 
of (i); any other shapes, e.g. the numerals ‘1’, ‘2’, . . . , ‘9’, would 
serve as well. Indeed, we need not even insist upon typographical 
shapes. Instead of considering the shapes ‘1’, ‘2’, . . . , ‘9’ and 
their complexes ‘11’, ‘12’, . . . , ‘99’, ‘111’, etc., we could just as 
well consider the corresponding natural numbers, 1, 2, . . ., 9, 11, 
12, ... , 99, 111, . . . themselves. 

We are thus led to an arithmetical analogue of the protosyn- 
tactical ‘M’. By way of notation for the analogue, let us use the 
letter ‘e’ in place of ‘M’. This sense of ‘e’ has nothing to do with 
the membership sense of ‘e’; but there is no danger of confusion, 
since in the new sense has contexts of the form r efij(T l while ‘e’ 
in the membership sense has contexts rather of the form r (f « 17 ) 1 . 
These new contexts are explicable in direct analogy to (i)-(iii) 
above: txyz if and only if 

(iv) x and y are respectively 2 and 1, or 3 and 2, or . . . , or 9 and 
8 , or else 
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(v) y and z are perpositive numbers (natural numbers in whose 
Arabic numerals ‘0’ does not appear) and x is y"z (cf. end of § 49), 
or else 

(vi) x is not a perpositive number and y is x. 

Where K' is the class of all those true statements which are built up 
of matrices r e<*jS-y~ l by joint denial and quantification, the only 
difference between K' and the previously described class K is that 
the statements of the one class contain ‘ t’ where the statements of 
the other class contain ‘M’. 

This new idiom ‘exyz’ is readily given a logical definition. To 
the series of logical definitions Dl-48 we first add the ones sug¬ 
gested at the end of § 49, together with the following definition of 
the class of perpositive numbers. 

D61. ‘Pp’ for 

‘x{x t Nn . (y)(* 5** y A 0 . ( z)(x = (z A 0) A y .D. z = 0))) ’. 
Then we simply reproduce (iv)-(vi) as a formal definition. 

D62. r €£ij0 " 1 for 

r (8 ^ ij. tj ^ 1 . f = S '17 .V. rj, 0 e Pp . f - t; a 0 .V. f c Pp . f = »j)L 

Let us return now to the protosyntax of logic, begun in §§ 55-58. 
The definitions A47-49, A62-64, and A66-71 were set up in such a 
way as to parallel Dl-5, D8-10, D12, and D15; and by continuing 
the senes in obvious fashion we finally reach a protosyntactical 
definition: 

r e{f, V, 0p for ... 

paralleling D62. Where x, y, and z are any variables, e{#, y, z} is 
the logical formula (built up of matrices r (a t /8)" 1 by joint denial 
and quantification) which D1-D62 abbreviate as followed by 
the successive variables x, y, and z (whichever ones they may be). 

Suppose now that y is a logical formula (in primitive notation, of 
course), and that x is an abbreviation of y by Dl-62. It may 
happen that x contains only primitive notation plus abbreviations 
of the form r eaj3y~ l explained by D62; x may be devoid of other 
abbreviation notation. When all these circumstances obtain, let 
us call x an epsilon abbreviate of y; symbolically *EAy. Where 
*EAy, thus, x consists only of the alphabet Si, ... , Sg, i.e. ‘w’. 
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‘x’, . . . , ‘ j. ’, V, whereof the protosyntax of logic speaks; but 
this does not make x a logical formula, for the mode of combination 
of signs which is essential to a logical formula is departed from in 
r ta0y 1 . At the same time x is an abbreviation, via Dl-62, of the 
genuine logical formula y. 

To say that x is an epsilon abbreviate of y is to say that x is one 
of a sequence of expressions each of which either is y or else is 
formed from an earlier expression of the sequence by abbreviating a 
part e{z, z', z"} as S 9 ~z~z'~z" (z, z', and z" being any variables). 
The protosyntactical definition of this notion of epsilon abbreviate 
is then readily accomplished, following our usual method of dealing 
with sequences. 

r (fEA7j)‘ 1 for r (3a)(f Ing a . (0)(0 Ing a .D: 0 = t) .V (3-y)(35) 

G«')(3«")(Vbl S. Vbl 5'. Vbl 5" .y?r a 0.0 Sl®j * j 7 ))) 1 . 

Under this definition, ‘xEAy’ is true also in the limiting case 
where x and y are the same expression; but this departure is of no 
consequence. 

Let us call x a quasi-theorem, symbolically Thm'x, if x is built up 
solely of expressions r ea0y 1 by joint denial and quantification, and 
is furthermore an abbreviation by Dl-62 of a theorem of logic. 
This notion is readily defined in protosyntax, given any proto¬ 
syntactical definition of theorem (e.g. that of § 58). 

r Thm'r for r ((3a)(Thm a . fEAa) , ~(3/3)(Fmla 0. 

Suppose now that the protosyntactical definition of r Thm is 
such as to include among the theorems all those logical formulae 
which are true, and exclude all falsehoods. Then an expression x 
will be a quasi-theorem, in the above sense, if and only if it belongs 
to the class K' described earlier. Thus if we expand ‘Thm'x’ into 
protosyntactical primitives we have a protosyntactical matrix 
‘ .. . x . . . ’ such that 

(2) (*)(...*,,.. = .*( K'). 

Whereas the protosyntactical ‘M’ (occurring in ‘. . . x . . .’) has 
thus far been construed in the fashion appropriate to the proto¬ 
syntax of logic, i.e. in such fashion that S 9 = ‘e’, let us now re- 
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construe it in the fashion appropriate rather to the protosyntax 
of protosyntax (§ 59); i.e., m such fashion that Sg becomes ‘M\ 
The matrix ‘. x . . then comes to satisfy the condition (1) 
rather than (2), since the expressions belonging to K' go over into 
those of K when ‘ e’ is changed to ‘M\ But we know that no 
protosyntactical matrix ‘...x...’ can satisfy (1). The initial 
assumption of the preceding paragraph is therefore false; no proto¬ 
syntactical definition of theorem can include all true logical 
formulae among the theorems and exclude the falsehoods. Logic, 
like protosyntax itself, is protosyntactically incompletable. 

It follows in particular that the notion of theorem which was 
developed in earlier chapters, and defined protosyntactically in 
§ 58, does not accord the status of theorem to all those logical 
formulae which are true statements — or else, worse, that it accords 
the status of theorem to some falsehoods. And, as the foregoing 
argument shows, any alternative notion of theorem which we might 
devise will suffer a similar fate, so long as we insist on protosyn¬ 
tactical definability. Nor is the demand of protosyntactical 
definability easily waived; it seems already to be more liberal than 
the normal practical demand of constructivity — the demand that 
for each theorem there exist an at least fortuitously discoverable 
method of confirming its theoremhood (cf. § 55). 

The protosyntactical incompletability of logic is of more imme¬ 
diate concern to us than was the protosyntactical incompletability 
of protosyntax; but the latter is more remarkable than the former, 
inasmuch as protosyntax has a far more limited subject matter than 
logic. Apprised only of the protosyntactical incompletability of 
logic, we should probably have blamed the difficulty on the con¬ 
nective V of membership and hence questioned the admissibility 
of that connective. The notion of membership is a natural object 
of suspicion; for it is this notion that imports the whole realm of 
classes of higher and higher orders of abstractness, and even calls 
for ad hoc measures such as the distinction between element and 
non-element for the avoidance of contradiction. Protosyntax 
itself, on the other hand, is wholly independent of the notion of 
membership and the theory of classes; it calls for no non-elements, 
indeed no entities whatever beyond an infinite domain of finite 
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expressions each of which is nameable within the notation of proto¬ 
syntax in systematic fashion. Alternatively, we have seen, those 
entities can be construed as natural numbers. When protosyn- 
tactical incompletability reasserts itself in as simple a field as this, 
we cease to regard such incompletability as a ground of suspicion 
and come rather to expect it in every fairly untrivial field. 

Protosyntax, though protosyntactically incompletable, can 
indeed be shown to be syntactically completable. I.e., whereas no 
matrix constructed solely of atomic matrices r Ma/ 37 n by joint 
denial and quantification will fulfill ( 1 ) when put for the blank 
‘... x introduction of additional atomic matrices of the form 
r (a e j3) n does enable us to construct a matrix which will fulfill 
( 1 ) when put for ‘... x ...’. On the other hand syntax itself, 
embracing membership in addition to the resources of protosyntax, 
is not even syntactically completable. This can be shown by 
applying syntax to itself just as protosyntax was applied to itself 
in § 59. (For this purpose the alphabet implicit in the interpreta¬ 
tion of ‘M’ is of course reconstrued as embracing both V and 
‘M’, as S 9 and Si 0 .) Whereas protosyntactical truth is syntacti¬ 
cally but not protosyntactically definable, syntactical truth is not 
even syntactically definable. 

Protosyntactical definability was thought of as moderately 
broader than constructivity, and syntactical definability was 
thought of as corresponding to the much broader property of 
formality (cf. §§ 53, 55). Insofar as this way of thinking does 
justice to the intent of the vague terms ‘constructive’ and ‘formal’, 
then, we must conclude that protosyntactical truth is a non¬ 
constructive formal notion and that syntactical truth is not formal 
at all. Whoever finds the latter conclusion intolerable had best 
explain his sense of ‘formal’. 

The argument from the protosyntactical incompletability of 
protosyntax to the protosyntactical incompletability of logic 
turned essentially upon the fact that a set of entities behaving like 
the objects of protosyntax — in short, a model of protosyntax — 
is discoverable among logically definable objects; viz., in the 
arithmetic of natural numbers. Now it is possible similarly, by 
exhibiting a logical model of syntax, to argue from the syntactical 
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incompletability of syntax to the syntactical incompletability of 
logic. Logical truth, like syntactical truth, is syntactically inde¬ 
finable. Logical truth may, subject to the cautions of the pre¬ 
ceding paragraph, be said to be informal. 

Thus even if we go so far as to waive the demand of protosyn- 
tactical definability in order to come by a notion of logical theorem 
which will cover all logical truths and exclude all falsehoods, the 
outlook remains dark; we should have to renounce syntactical 
definability as well. Indeed, a notion of theorem capable of ex¬ 
hausting those logical formulae which are true and excluding those 
which are false will be definable only in a medium so rich and 
complex as not to admit of a model anywhere in the reaches of 
the theory of logic which is under investigation. An exhaus¬ 
tive formulation of logical truth which carries general recogniza- 
bility with it, even of the most tenuous sort, is not to be aspired to. 
We must in practice rest content with one or another notion of 
theorem which covers an important subclass of the logical truths 
and does not encroach upon the falsehoods. 


The fact that logic generally and the elementary theory of natural numbers in 
particular is incompletable by constructive means (cf. § 55) was discovered and 
established by Godel (1931); and it is to him that the essential idea behind the 
technique of self-application (§ 59) is due. The protosyntactical incompletability of 
protosyntax and the syntactical incompletability of logic are cases of a more sweep¬ 
ing observation which is due to Tarski (“ Wahrheitsbegriff,” pp. 370ff; also 11 Unde- 
cidable Statements”). 



APPENDIX 

Theorem versus Metatheorem 


Many readers of the first edition have expressed uneasiness over 
the informality of the proofs of metatheorems such as *111. Some 
have even sensed here a circularity of reasoning. Thus, it is ar¬ 
gued, the proof of *111 makes implicit use of the principles of 
mathematical induction and the substitutivity of identity; yet 
these principles are established only in *222 and *636, on the basis 
of *111 itself. 

These misgivings turn on a confusion between theorem and meta¬ 
theorem which a more careful reading would probably clear up. 
The matter has proved troublesome enough, however, to warrant 
some supplementary exposition at this point. 

By a formal deduction let me be understood to mean any list of 
formulas such that every formula of the list is either an axiom of 
quantification (or of membership, after § 29) or a ponential of two 
earlier formulas of the list. Thus each single axiom is itself a short 
formal deduction, one line long. But none of the so-called proofs 
in the book are formal deductions; indeed the only formal deduction 
appearing anywhere in the book, apart from single axioms, is the 
sample at the middle of page 86. 

Now a theorem, as the term is used in this book, is no more nor 
less than a statement that can be reached as a line of a formal de¬ 
duction (cf. page 86). To show that a statement is a theorem it is 
sufficient, therefore, to write down a formal deduction which ter¬ 
minates with the statement in question. 

But we also may know that a statement is a theorem without 
ever seeing a formal deduction of it. To know that a statement is a 
theorem is to know that the statement possesses a formal deduction 
which could be written down. 

Suppose, e.g., that we have two formal deductions before us, and 
that their respective last lines have the forms: 
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(y)(*)(---) 


(!) (y)(x)(u>)(---D -), 

where the matrix represented as ‘-’ is the same in both cases. 

Now it is true, but not obvious, that any two such deductions can 
be combined and pieced out with a few more lines so as to yield a 
third formal deduction whose last line is ‘(y)(w)(. . . .)’ — suppos¬ 
ing ‘ y ’ and ‘w’ to be the free variables of the matrix represented as 
‘. . . . Once we have assured ourselves of the unobvious fact that 
such a third formal deduction is constructible in all such cases, we 
can thenceforward always leap directly from a knowledge of the 
theoremhood of two statements of type ( 1 ) to a knowledge of the 
theoremhood of the corresponding statement ‘ (y)(«')(- Time 

and space are thereby saved. 

Now the unobvious fact spoken of above is what *111 affirms. 
*111 says of all formulae <}> and \p that if the closures of r </> and 
<f> possess formal deductions, so does the closure of i p. Because the 
fact is not obvious, we have to convince ourselves of it; and this is 
the purpose of the so-called proof on pages 90-91. It is satisfac¬ 
tory in this proof to avail ourselves of anything that we know in 
fact to be true, such as that the closure of r 4> 1 is the closure 

of r (a)(<A 3</') n (where a is alphabetically the earliest free variable), 
or such as that if 0 has some property and the number after each 
number having the property has the property then every positive 
integer has the property. It is satisfactory to avail ourselves thus 
of anything we know to be true because we are not seeking a formal 
deduction; we are merely seeking the truth about formal deduc¬ 
tions. 

The worry over a circular presupposition of *222 or *636, in the 
argument of * 111 , involves a misunderstanding of the content of 
*222 and *636. These metatheorems say that the closures of cer¬ 
tain conditionals are theorems, i.e., possess formal deductions. 
These facts are not obvious, and no knowledge of them is presup¬ 
posed in proving * 111 . The most that is presupposed there is that 
the closures of conditionals like those concerned in *222 and *636 
are true; and this much is too evident to require support. 

It must be emphasized that common-sense reasoning is never 
smuggled into the formal deductions themselves, formal deduc- 




THEOREM VERSUS METATHEOREM 321 

tions proceed from axioms by means exclusively of modus ponens. 

Metatheorems are informal observations about theoremhood — 
hence about the possibility of formal deductions. But it must be 
noted that even in the case of actual theorems, e.g. 1182—1185, the 
“proofs” which are given in the book are on a par with the proofs 
of metatheorems; none are formal deductions, each is merely an 
argument to show that an appropriate formal deduction could be 
obtained. This is why metatheorems can quite properly be cited 
in the proofs of theorems. In the case of f 182, as it happens, a 
formal deduction could also very easily be presented, consisting 
merely of 1182 itself (expanded into primitive notation); for this is 
actually an axiom of quantification of kind A (see page 81). In 
the case of 1183—1185 and most other theorems stated in the book, 
however, the formal deductions would run to considerable lengths. 

As long as we are interested only in formalizing logic and not in 
formalizing our discourse about the formalization of logic, the 
proofs of metatheorems and theorems presented are open to no 
objection on the score of informality. All these proofs are informal 
arguments regarding the existence of formal deductions. But 
there is reason for interest also in formalizing our discourse about 
the formalization of logic; and this topic is indeed dealt with, in 
certain of its aspects, in Chapter VII. 
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The definitions Dl-48, introduced in the course of §§ 9-48, are 
assembled here for convenience of reference. The numerous 
definitions of §§ 49-59, not having been used in formal proofs, are 
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suggested in the course of earlier discussion (pp. 49, 141, 201, 208, 
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Various of the theorems and metatheorems are assembled here 
for convenience of reference. A bookmark at this point will prove 
useful. 

The plan has been to include a theorem or metatheorem in the 
list just in case it is cited by number at least twice in proofs subse¬ 
quent to the section in which it made its own appearance. Hence 
the reader should not use this list in looking up numbers which are 
close to the number of the theorem or metatheorem whose proof 
he is examining; instead he should merely glance back over 
neighboring pages. In cases where the theorem or metatheorem 
sought is missing from the list because of not having been used 
twice, the reader will indeed consult the list and be disappointed; 
but from the list he will learn the page numbers of neighboring 
theorems and metatheorems, thus expediting his consultation of 
the text. 

Initial universal quantifiers have for brevity been omitted in 
listing theorems. 
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Positive number 276f. 

Potentiality 16. 

Power: arithmetical 259, 261f, 

264f, 270, 274f, 277f; relative 
253ff, 259, 265. 

Predecessor 252. 

Predicate 27ff, 119, 149, 151. 

Predication, relational 200f, 227. 

Prefix, see Quantifier. 

Primitive notation 40f, 47, 75, 100, 
133, 143, 149ff. 

Primitives, reduction of 2f, 5ff, 43ff, 
49, 72f, 126, 288ff. 

Priority 296. See also Alphabetic 
order. 

Product: arithmetical 259ff, 268ff, 
274, 277; logical 179ff; relative 
213f. 

Projection, see Arrow projection; 
also Image. 

Proof 5f, 319ff. 

Proof notation 9Iff, lOlff, 129, 
138f, 145, 173f, 176f, 182, 204, 
262 f. 

Proper ancestral 220f. 

Proper names 149ff. 

Property 119f, 237. 

Proposition 32. See also State¬ 
ment. 

Propositional function 121. See 
also Property; Statement matrix. 

Protasis 14. 

Protosyntax 291ff, 306ff, 314f. 

Quantification 18, 55, 67ff, 285; 
axioms of 80ff, 87ff, 300f; ex¬ 
istential lOlff, 303; universal 
102; vacuous 74, 82, 88, 104. 

Quantificational diagrams 70, 76. 

Quantifier 67ff, 142; confinement 
of 108f; distribution of 82, 88, 
105ff; insertion of 139f; permu¬ 
tation of 89, 95, 104; scope of 
76f. 
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Quasi-atomic context 140ff, 167, 
294. 

Quasi-quotation 33ff, 110, 287, 290. 
Quasi-theorem 315. 

Quotation 23ff, 33ff, 290. 

Quotient 269f, 275fF. 

Range of functionality 222ff, 228. 
Ratio 25, 266ff, 27Iff. 

Rational 272. See also Ratio. 

Real number 271ff. 

Real variable, see Free variable. 
Recursive 86, 292, 318. 

Reduction sentences 33. 

Reflexivity 56, 61, 138, 185, 232. 
Relational part 205ff. 

Relations 197ff; abstractionof 202ff, 
208; predication of 200ff, 227; 
theory of 61, 127, 279. 

Relative: power 253ff, 259, 265; 
product 213f. 

Relettering 109ff, 159, 177. 

Reverse, see Converse. 

Root 275f. 

Russell’s class 1281F, 157, 179, 249. 

Scope 16, 40, 190, 217, 227. 
Self-augment 246ff. 

Self-membership 125, 128ff, 157, 
175, 179, 249. 

Semantic 24f. 

Sequence 295f, 299. 

Set 127. See also Class; Counter 
set; Truth set. 

Sign 283ff, 288; and object 23ff, 
283ff. 

Signed real numbers 276f. 
Single-valuedness 222. 

Singulary 13; scope of such oper¬ 
ators 16, 190, 217, 227. 
Specification 82ff, 88, 151, 171f. 
Spelling 284. 

Star, see Asterisk. 

Statement llff, 7 Iff, 79, 298; 

logical 125. 


SUBJECTS 

Statement connective 27ff. 

Statement matrix 71ff, 79, 298. 

Statement predicate 27ff. 

Stratification 155ff, 164, 302f; of 
matrices 159f; test of 158, 238, 
302f. 

Strict implication 32. 

Subjunctive 16, 29, 33. 

Subscripts 35, 67, 254, 269, 275, 
278. 

Substitution 170ff, 301ff. 

Substitutivity: of biconditional 

96ff; of identity 160f, 167ff, 174. 

Subtraction 276f. 

Successor 237ff, 252f, 259ff. See 
also Alphabetic order. 

Sum, arithmetical 259ff, 268f, 

274f, 277; logical 180ff. 

Symmetry 138, 232. See also Com¬ 
mutativity. 

Syntax 3ff, 283ff. 

Tautology 50ff, 55, 86ff, 93, 199, 
299. 

Term 119ff, 135f, 145, 152, 294. 

Testability 5f, 81, 86ff, 156, 29If. 

Theorem 6,- 85ff, 127, 155f, 162f, 
29If, 318ff; notation of 137, 162; 
protosyntactical 306f; proto- 
syntactical definition of 305, 315f. 

Tilde 14, 140. 

Total reflexivity 138, 232. 

Traditional logic 1, 7, 189, 210. 

Transitivity 58, 61, 138, 185, 232. 

Transposition 57, 186f. 

Truth If, 4, 27ff; logical Iff, 28, 
30, 50, 80ff, 86, 313, 315, 318; 
protosyntactical 313, 317; syn¬ 
tactical 317. 

Truth-functional: component 52ff, 
299; composition llff, 29, 33, 
42ff. 

Truth sequence 299. 

Truth set 53ff, 299. 

Truth table llff, 19ff, 42ff, 51ff. 
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Truth value llff. 

Two 237ff, 267, 275. 

Types, theory of 163ff. 

Unit class 135f, 189ff, 241. 

Universal class 144f, 163, 183f, 
188, 304f; infinitude of 248, 2511F. 
Universal quantification 102. See 
also Quantification. 

Universal relation 205f. 

Unnamable numbers 273f. 

Unsigned real numbers 276. 

Upper bound 272f. 

Use and mention 23fF, 283ff. 


SUBJECTS 

Vacuous: occurrence If, 28, 50; 
quantification 74, 82, 88, 104. 

Value: of a function 222; of a 
variable 34f, 287. 

Variables 34f, 68ff, 75, 109f, 283ff, 
287, 300; as pronouns 3, 5, 68, 
71; collision of 74f, llOff; elimi¬ 
nation of 71; rewriting 109ff, 
133f, 159, 177; values of 34f, 287. 

Whole number 267, 270, 275f. See 
also Natural number. 

Zero 237ff, 267ff, 275f. 



